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ABSTRACT 

We explore the redshift evolution of the specific star formation rate (SSFR) for galaxies of different 
stellar mass by drawing on a deep 3.6 /mi-selected sample of > 10 5 galaxies in the 2 deg 2 COSMOS 
field. The average star formation rate (SFR) for sub-sets of these galaxies is estimated with stacked 
1.4 GHz radio continuum emission. We separately consider the total sample and a subset of galaxies 
that shows evidence for substantive recent star formation in the rest-frame optical spectral energy 
distributions. At redshifts 0.2 < z < 3 both populations show a strong and mass-independent decrease 
in their SSFR towards the present epoch. It is best described by a power- law (1 + z) n , where n ~ 4.3 
for all galaxies and n ~ 3.5 for star forming (SF) sources. The decrease appears to have started at 
z > 2, at least for high-mass (Af* >4x 10 10 M Q ) systems where our conclusions are most robust. 
Our data show that there is a tight correlation with power-law dependence, SSFR cx M*^, between 
SSFR and stellar mass at all epochs. The relation tends to flatten below M* ps 10 10 M© if quiescent 
galaxies are included; if they are excluded from the analysis a shallow index /?sfg ~ —0.4 fits the 
correlation. On average, higher mass objects always have lower SSFRs, also among SF galaxies. At 
z > 1.5 there is tentative evidence for an upper threshold in SSFR that an average galaxy cannot 
exceed, possibly due to gravitationally limited molecular gas accretion. It is suggested by a flattening 
of the SSFR-Af* relation (also for SF sources), but affects massive (> 10 10 M©) galaxies only at the 
highest redshifts. Since z = 1.5 there thus is no direct evidence that galaxies of higher mass experience 
a more rapid waning of their SSFR than lower mass SF systems. In this sense, the data rule out any 
strong 'downsizing' in the SSFR. We combine our results with recent measurements of the galaxy 
(stellar) mass function in order to determine the characteristic mass of a SF galaxy: we find that since 
z ~ 3 the majority of all new stars were always formed in galaxies of M* = io la6±0 - 4 M©. In this 
sense, too, there is no 'downsizing'. Finally, our analysis constitutes the most extensive SFR density 
determination with a single technique out to z = 3. Recent Herschel results are consistent with our 
results, but rely on far smaller samples. 

Subject headings: galaxies: evolution - surveys - radio continuum 



1. INTRODUCTION 

Over the last years multi- waveband surveys of various 
wide fields have lead to estimates of star formation rates 
(hereafter SFRs) and stellar masses for large numbers of 
galaxies out to high redshifts. Both quantities are crucial 

1 Max-Planck-Institut fur Astronomie, Konigstuhl 17, D- 
69117 Heidelberg, Germany [email: karim@mpia.de] 

2 Institute of Cosmology and Gravitation, University 
of Portsmouth, Dennis Sciama Building, Burnaby Road, 
Portsmouth, POl 3FX, United Kingdom 

3 Astrophysics, Department of Physics, University of Oxford, 
Keble Road, Oxford OX1 3RH, United Kingdom 

4 SEPnei, South-East Physics network 

5 Laboratoire d'Astrophysique de Marseille, BP 8, Traverse du 
Siphon, 13376 Marseille Cedex 12, France 

6 ESO ALMA COFUND Fellow, European Southern Observa- 
tory, Karl-Schwarzschild-Strasse 2, 85748 Garching b. Miinchen, 
Germany 

7 Argelander Institut for Astronomy, Auf dem Hiigel 71, Bonn, 
53121, Germany 

8 National Radio Astronomy Observatory, P.O. Box 0, So- 
corro, NM 87801-0387, USA 

9 Space Telescope Science Institute, 3700 San Martin Drive, 
Baltimore, MD 21218, USA 

10 Department of Astronomy, University of Michigan, 500 
Church Street, Ann Arbor, MI 48109, USA 

11 Max-Planck-Institut fur extraterrestrische Physik, Giessen- 
bachstrasse 1, D-85748 Garching, Germany 

12 International Max Planck Research School for Astronomy 
and Cosmic Physics at the University of Heidelberg 



for understanding galaxy evolution. On the one hand an 
evolution of the observed number density of galaxies as 
a function of stellar mass, i.e. the mass function, reveals 
how the stars are distributed among galaxies at differ- 
ent cosmic epochs. If, on the other hand, an increase 
in stellar mass of any population of galaxies can solely 
be explained by the rate at which new stars are formed 
within these systems or if other mechanisms are dom- 
inant can only be discussed if the corresponding SFRs 
themselves are known. 
A number of studies ( e.g. iLilly et al.l 119961: 



Madau et alJl!996t ICharv fc Elbadl2001t iLeFloc'h et all 



20051: ISmolcic et all 
Rodiahicro et all ~ 



2009a; 



.Dunne et al.l 
20101* iGruppioni et all 



2009; 



2010; 



iBouwens et all 120101; iRuiopakarn et all 120101 and 
for a compilation iHopkinsI 120041 and lHopkins &; Beacoml 
120061 ) revealed that the star formation rate density 
(hereafter SFRD), i.e. the SFR per unit comoving 
volume, rapidly declines over the last ~ 10 Gyr following 
the purported maximum of star formation activity in 
the universe. The question of whether the stellar mass 
content of galaxies could be a major driver for this 
decline has gained significant interest after the discovery 
of a tight correlation of SFR and stellar mass for star 
forming (hereafter SF) galaxie s with an intrinsic scatter 
of only about 0.3 dex (e.g. iBrinchmann et all 120041 ; 
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iNoeske et all l2007bl lElbaz et all [20071). This relation 
was s tudied in the loca l universe jBrinchmann et al.l 
120041 : iSalim etldl 120071) suggesting an apparent bi- 
modality in the SFR-M* plane if all galaxies are taken 
into account. It was also found to exist for SF galaxies 
at z < 1.2 (e.g . lNoeske"eTaIll20075 lElbaz et alJl2007t 
iBell et al.ll2007t iWalcher et all 120081) and furth er out to 
z w 2.5 ([Daddi et al.l[2007t IPannella et al.ll2l)09l )FI Con- 
sequently the stellar mass normalized SFR (hereafter 
specific SFR or SSFR), i.e. the SFR at a given epoch 
divided by the stellar mass the galaxy possesses at the 
same cosmic epoch, shows a tight (anti-)correlation. 

By studying the SSFR galaxies of different stellar 
masses can be directly compared. The SSFR itself de- 
fines a typical timescale that can be interpreted as a cur- 
rent efficiency of star formation within a galaxy com- 
pared to its past average star formatio n activity. The 
compilation of the studies mentioned (e.g. IPannella et al.l 
[20091: [GonTalez et aLlIlM IDutton etaJ]|2010D . not us- 
ing a common tracer for star formation nor selection 
technique for separating the SF galaxy fraction and data 
originating from various wide fields, suggests a steep evo- 
lution of the normalization of the SSFR-M* relation 1 ^ for 
SF galaxies. Studies, covering a broad dynamical range 
in stellar mass, have been carried out for all galaxies 
and confirmed the SSFR, as a function of redshift, to be 
even more rapidly in c reasing from z = to z sa 1 (e.g. 
Feulner et alTl2005bl: iZheng et all l2007al iDamen et all 
2009bi|20ipJ; as well as throughout an even wider range in 
redsh i ft (e.g. iFeulner et aLll2005at iPerez-Gonzalez et al.l 
I2008t IDunne et all I2009t IDamen et al.l I2009al ). It has 
been claimed that the steepness of the SSFR-increase 
with redshift might be a challenge for a cold dark matter 
concordance model (ACDM) suggested by comparisons 
to predictions from semi-analytical mode l s (SAMs) (see 
Santini et all 120091: IDamen et all l2009at iFirmani etaT] 
20101 and. e.gV. IGuo fc Whitell2008l for theoretical results 
based on a SAM). 

It was recent l y dis cussed by IStark et "aLl (|2009T > and 
I Gonzalez et al.l (j2010f ) . at least for moderately massive 
SF galaxies (M* ~5x 10 9 M ), that the rapidly evolv- 
ing SSFR might turn constant in the early universe. 
Their data show constant SSFRs up to the highest red- 
shift ranges [z ~ 7 — 8) probed so far. This signifi- 
cant deviation of the SSFR-evolution from a power-law 
(SSFR oc (1 + z) n ), fitting well the data below z « 2, 
could be a hint for different physical mec hanisms regulat- 
ing s tar formation in the early universe ([Gonzalez et al.l 
I2010I) . However, this deviation could also be a result of 
observational data significan tly underestimating the SS- 
FRs at these high redshifts (|Dutton et al.l [20101) caused 
by selection biases [3- Re cent theoretical models propose 
an enhanced merger rate (jKhochfar & Silk 2010) at high 

13 It needs to be mentioned that at z ~ 2 Erb et al. (2006) only 
found a weak correlation between SFR and stellar mass. However, 
their galaxy sample selection at ultraviolet wavelengths preferen- 
tially traces SFR rather than stellar mass, thus potentially biasing 
their results towards a flatter SFR-M, relation. 

14 In the following we will refer to this relation for SF galaxies 
as the SSFR-sequence. 

15 Note the very small number of galaxies currently stud- 
ied in the extreme high redshift regime. Also note the h ighly 
discrepant SSFR-estimates presented by [Yabc et al. ( 200^) and 
ISchaerc r & dc Barros (2010) at the most extreme redshifts as sum- 
marized by Bouchc et al. (2010) in their Fig. 13. 



z in order to account for the purported constancy of the 
SSFR. This is in contrast to pure steady cold-mode gas 
accretion above a limiting dark matter halo mass (the 
so-ca lled 'mass floor' of Mdm ~ 10 11 M Q ) (|Bouche et al.l 
l2010f ) reproducing well the observed slope of the SSFR- 
sequence at all z < 2. 

It was generally found that at z < 2 all galaxies show 
a significant (negative) slope of the SSFR-M* relation 
leading to lower SSFRs in more massive galaxies. Star 
forming galaxies also seem to show this behavior but the 
trend tends to be significantly weaker especially at z > 1 
where, based on the sBzK selection t echnique, the slope 
was found to be prac tically vanishing (jDaddi et alj|2007l : 
IPannella et al.l [20091) . It therefore is an ongoing debate 
if this phenomenon of a decreasing slope of the SFR- 
M* relation for SF galaxies with redshift is real or just 
an artifact (for an introduction and a s ummary of the 
confl icting observational results see e.g. iFontanot et al.l 
2009). This effect is commonly interpreted as star for- 
mation efficiency being shifted from higher mass ob- 
jects in the cosmic past to lower mass objects in the 
prese nt and sometimes referred to as 'cosmic downsiz- 
ing' ([Cowie et al.l 11996) . Most recently, based on first 
Herschel/PACS far-infrared data, even the opposite ef- 
fect, the s o-called SSFR-upsizing at z > 1.5, has been 
proposed ([Rodighiero et alJl2010ah FI 

More measurements are needed to understand the re- 
lation of SFR and stellar mass and its evolution with 
redshift. This holds especially true for the population of 
SF galaxies. An accurate measurement of the (S)SFR- 
sequence at all epochs is key for a better understanding 
of gala xy evolution. As it was claimed (e.g. INoeske et all 
I2007H ) a tight correlation of SFR and stellar mass disfa- 
vors star formation histories (SFHs) of individual normal 
galaxies that are mainly driven by stochastic processes, 
such as mergers. Quite contrarily it favors smooth SFHs 
in such a way that the SFH at any cosmic epoch of a 
galaxy is solely determined by its stellar mass content 
measured at the corresponding redshift unless the galaxy 
becomes subject to quenching of star formation. In this 
sense the SFR-M* relation at a given redshift is regarded 
an isochrone for galaxy evolution in the same manner the 
Hertzsprung-Russel-Diagram is an isochrone for the evo- 
lution of a stellar population at a given ageFI It sh ould 
be mentioned, however, that iCowie fc Bargerl (|2008l ) dis- 
agree with this conclusion which underlines the impor- 
tance of future studies that use a sufficiently deep direct 
SFR tracer to study the intrinsic dispersion of the SSFRs 


Several tracers across the electromagnetic spectrum 

16 This t r end i s weakly supported by the earlier findings of 
IQliver et all pOlCft . 

17 The (S)SFR-mass relation is therefore also someti mes referred 
to as 'the galaxy main sequence' (Nocskc et al. 2007b) that is, for 
an individual galaxy of stellar mass M*, connected by evolution- 
ary tr acks (e.g. the so-called tau-model discussed in Nocs ke~et al.l 
I2007aft at distinct cosmic epochs (see also INoeske 2009, for a sum- 
mary) . 

18 ICow ie & Barecr (2008) cannot confirm the lo w level of in- 
trinsic dispersion in the SSFR-M* plane found by INoeske et al.l 
(1200 7bT) and they discuss other hints they find supporting SFHs to 
be rather dominated by episodic bursts. We emphasize that the 
larger dispersion of SSF Rs might be caused by the relatively broad 
bins in redshift used by Cowie & Bargcr (2008) given the steep in- 
crease with redshift of SSFRs at z < 1.5 while studying all massive 
galaxies. 
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are used to estimate the star formation rate of a nor- 
mal galax\f ra l. While rest-frame ultraviolet (UV) light 
originates mainly from massive stars and thus directly 
traces young stellar populations it will be strongly atten- 
uated by dust. The absorbed UV emission is thermally 
reprocessed by heating the dust which in turn reemits 
at infrared (IR) wavelengths. Star formation also leads 
to emission in the radio continuum since charged cosmic 
particles are accelerated in shocks within the remnants 
of supernovae ( SNR) leadin g to non- thermal synchrotron 
radiation (e.g. iBelll Il978al and e.g. iMuxlow et al.l 11994 
for observations of individual SNRs). Thermal free- free 
emission (Bremsstrahlung) in general c ontributes only 
weakly to the 1.4 GHz signal (see e.g. iCondonl I1992T) 
but might become dominant in low-mass systems where 
the synchrotron emi ssion was e mpirically found to be 
strongly suppressed (jBelj [2003) . Also empirically the 
phenomenon of radio emission triggered by star forma- 
tion results in its well known strong c orrelation with the 
far-IR output of a given SF galaxy (e . g.lHelou et aill985t 
ICondonl [1991 lYun et al.l l200ll; iBell l2003|) that appears 
to persist ou t to high (z > 2) red shifts in a non-evolving 
fashion (e.g. ISargent et al1l2010al fbh. 

A major advantage of radio emission as a tracer for 
star formation is its obvious independence of any correc- 
tion for dust attenuation. Due to well known underlying 
physical processes the spectral energy distribution of a 
normal galaxy in the low (< 5) GHz regime shows a 
F v oc z/*" shape (e.g. [Bell [l978ah . While a TC = -0.8 is 
foun d to be a typical value fo r the radio spectral index 
(e.g . ICondonl IT992I: iBeS] 12003 } for a summ ary but also 
e.g. IScheuer fc Williamsl fl96a lBelll978bl for early re- 
sults) no further spectral features are expected in this 
frequency range thus leading to a robust K-correction 
up to high (z < 3) redshiftsH Both advantages di- 
rectly confront the rather uncertain dust attenuation co- 
efficient for UV light and the presence of polycyclic aro- 
matic hydrocarbon (PAH) emission features redshifted 
(at z > 0.8) into the 24 /im band commonly used as 
an estimator for the total infrared (TIR) emission. Also 
the combination of UV and mid-IR emission tracing star 
formation is limited since it is typically tested in mod- 
erately SF systems at low redshift (for a summary see 
iCalzetti k, Kennicuttl [2009) which might not resemble 
high redshift galaxies with higher SFRs and larger dust 
content. Finally, even at a resolution of ~ 5" achieved 
by current UV and IR telescopes blending of sources be- 
come s a severe issue for t he faint end of the sources (see, 
e.g., iZheng et al.ll2007bl ). Current radio interferometers 
such as the (E)VLA and (e)Merlin achieve resolutions of 
< 2" that are needed to unambiguously identify optical 
counterparts. This unambiguity is particularly impor- 
tant in a stacking experiment as otherwise flux density 
from nearby sources might contribute to the emission of 
an individual object. A drawback of using radio emission 
to trace star formation is the generally low sensitivity to 
the normal galaxy population even in the deepest radio 
surveys to-date which usually limits the analysis to a 



19 'Normal' galaxies are defined as systems that do not host an 
active galactic nucleus. 

20 Unless radiative losses, e.g. inverse Compton scattering 
against the cosmic microwave background, steepen the spectral 
index to values ~-1.3. 



stacking approach. Therefore, current radio surveys al- 
low one to study average SFR properties while they can- 
not shed light on the intrinsic dispersion of individual 
sources. This situation will improve with future EVLA 
surveys. 

Studying the stellar-mass dependence of the SFH re- 
quires a mass-complete sample in order to prevent in- 
ferred evolutionary trends from being mimicked by sam- 
ple incompleteness. Early type galaxies containing pre- 
dominantly older stellar populations and showing there- 
fore a prominent 4000 A break (see e.g. iGorgas et al.l 
1999) arc likely to be excluded in optical surveys above 
z <~ 1 even at deep limiting magnitudes as the break 
is redshifted into the selection band. Optical selec- 
tion, thus, potentially limits any study of a stellar mass- 
complete sample to the bright (i.e. high-mass) end or 
is effectively rather a selection by unobscured SFR than 
by stellar mass if the full sample is considered for the 
analysis. 

Channel 1 of the IRAC instrument onboard the Spitzer 
Space Telescope provides us with the 3.6 (im waveband 
that samples the rest-frame i^-band at z <~ 0.5 to the 
rest-frame z-band at z ~ 3. It is therefore ideal in prob- 
ing mainly the light from old low-mass stars while not be- 
ing severely affected by dust. For the analysis presented 
here, hence, a deep and rich (~ 100, 000 sources at z < 3) 
3.6 galaxy sample in combination with accurate pho- 
tomet ric redshifts and stellar-mass estimates has been 
used filbert et al.ll2010l) . With a sky coverage of 2 deg 
the Cosmic Evolution SurvejEH (COSMOS ) provides the 
larges t cosmological deep field to-date (see lScoville et al.l 
2007c], for an overview). The uniquely large COSMOS 
3.6 /jm galaxy sample offers uniform high-quality pan- 
chromatic data for all sources enabling us to study the 
SSFR in small bins in both stellar mass and redshift. Ad- 
ditionally the evolution of the stellar mass-functions has 
been studied alread y based on the sa me sample and its 



SF sub- population (Ilbert et al.l l20100. As it was argued 
(e.g. in lDaddi et al.ll2010al) the combination of the indi- 



vidual evolutions of the mass function and the (S)SFR- 
sequence might be the most important observational con- 
straints for understanding the stellar mass built-up on 
cosmic scales jointly resulting in a potentially peaking 
and declining SFRD. 

This paper is organized as follows. In Sec. [2]we present 
our principle and ancillary COSMOS data sets and the 
selection of our sample. Sec [3] contains a detailed de- 
scription of our stacking algorithm and the derivation of 
average SFRs from the 1.4 GHz image stacks. Additional 
methodological considerations pertaining to both sample 
selection and flux density estimation by image stacking 
are to be found in the Appendices. Readers who wish to 
directly proceed to our results and their interpretations 
can find those regarding the relation of SSFR and stellar 
mass in Sec. [H Our measurements of the CSFH and 
a simple model that reproduces these observations are 
discussed in Sec. [5l Both Sections ([4] and [5]) contain a 
detailed discussion of how our results relate to the recent 
literature. We summarize our findings in Sec. [6l 

Throughout this paper all observed magnitudes are 
given in the AB system. We assume a standard cosmol- 

21 http:/ /cosmos. astro. caltech.edu 
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ogy with H = 70 (km/s)/Mpc, M = 0. 3 and fi A = 0.7 
consi stent with the latest WMAP results (|Komatsu et al.l 
120091) as well as a radio spectral index of a TC — —0.8 in 
the nota tion given abov e if not explicitly stated other- 
wise. A IChabrierl (pOOl initial mass function (IMF) is 
used for all stellar mass and SFR calculations in this ar- 
ticle. Results from previous studies in the literature have 
been converted accordingly^ 

2. THE PAN-CHROMATIC COSMOS DATA USED 

In order to study the redshift evolution of galaxies in 
general, and the evolution of their SFRs in particular, a 
complete and large sample of normal galaxies is needed as 
it not only provides representative but also statistically 
significant insights. 

The large area of 2 deg 2 covered by the COS- 
MOS survey, fully imaged at optical wavelengths 
by th e Hubble space telescop e (HST) (jScoville et al.l 
I2007al IKoekemoer et al.l 120071 ). is necessary to mini- 
mize the effect of cos mic variance. Deep UV GALEX 
(jZamoiski et al. | l2007f) to ground-based optical and near- 
infrared (NIR) ([Taniguchi et al.|[2007l:lCapak et aT][2007h 
imaging of the equatorial fieldZl yielded accurate pho- 
tometric data products for ~ 1 x 10 6 galaxies down 
to 26.5th magnitu de in the i-band ([Ilbert et al.l 120091 : 
ICapak et al.l I2007h . Thanks to extensive spectroscopic 
efforts at optical wavelentghs using VLT/VTMOS and 
Magellan/IMACS (ILillv et all [20071: 1 Trump et all 120071) 
the estimation of photometric redshifts for all these 
sources could be accurately calibrated. Ongoing deep 
Keck/DEIMOS campaigns (Pis Scoville, Capak, Salvato, 
Sanders and Karteltepe) extent the spectroscopically ob- 
served wavelength regime to the NIR which is critical to 
improve the photometric calibration for faint sources at 
high redshifts. In addition to observations of the whole or 
parts of the COSMOS fi eld in the X-ray (Hasingc r et al. 
2007J lElvis et all 120091) and millimeter (jBertoldi et~ 
2007t IScott et al.ll2008[). imag ing by Spitzer in the mid- 
to far-IR (San ders et al.ll2007fl as well as interf erometric 
radio data ([Schinnerer et al.l 12004 120071 . 120101) covering 
the full 2 deg 2 have been obtained. 

2.1. VLA-COSMOS radio data 

Radio observations of the full (2 deg 2 ) COSMOS field 
were carried out with the Very Large Array (VLA) at 
1.4 GHz (20 cm) in several campaigns between 2004 
and 2006. T he entire field was obse rved in A- and C- 
configuration (jSchinnerer et al.ll2007D where the 23 indi- 
vidual pointings were arranged in a hexagonal pattern. 
Additional observations of the central seven pointings 
in th e more compact A-configuration (jSchinnerer et all 
2010) were obtained in order to achieve a higher 1.4 GHz 
sensitivity in the area overlappin g with the CO SMOS 
MAMBO millimeter observations ([Bertoldi et al.l 120071) . 
In both cases the data reduction was done using stan- 
dard procedures from the Astronomical Imaging Pro- 
cessmg System (AIPS) (see iSchinnerer et all 120071 for 

22 Lo garithmi c mass es and SFRs based on a Salpcter (J1955I) 
IMF, a IKroupal 1120011) IMF and a IBaldrv fc Glazebrookl mM) 
IMF are converted to the Chabrier scale by adding -0.24 dex, dex 
and 0.02 dex, respectively. 

23 The COSMOS field is centered at RA = 10 : 00 : 28.6 and 
Dec = +02 : 12 : 21.0 (J2000) 



details). At a resolution of 1.5" x 1.4" the final map has 
a mean rms of ~ 8 /zJy/beam in the central 30' x 30' and 
~ 12 /xjy/beam over the full area, respectively. Using 
the SAD algorithm within AIPS, a total of 2,865 sources 
were identified at more than 5cr significance in the final 
VLA-COSMOS mosaic ([Schinnerer et al.ll2010h . As the 
outermost parts of the map are not covered by multiple 
pointings the noise increases rapidly towards the edges. 
In this study we therefore exclude these peripheral re- 
gions resulting in a final useable area of 1.72 deg 2 . 

2.2. A 3.6 fim selected galaxy sample within the 
COSMOS photometric (redshift) catalogs 

Deep Spitzer IRAC data mapping the entire COS- 
MOS field in all four channels hav e been obtained dur - 
ing the S-COSMOS observations (|Sanders et all 120071) . 
The data reduction yielding images and associated un- 
certainty maps fo r all the four channels is described in 
lllbert et all ([20101) (110 hereafter). For the 3.6 fim chan- 
nel a source catalog has been obtained by O. Ilbert and 
M. Salvato ( private communication) using the SExtrac- 
tor package ([Bertin fc Arnoutsl 119961) . Given the point 
spread function (PSF) of 1.7" a Mexican hat filtering of 
the 3.6 fim image within SExtractor was used in order 
to assure careful deblending of the sources. 

The resulting sample of 3.6 fim sources down to a lim- 
iting magnitude of toab(3.6 fim) = 23.9 in the 2.3 deg 2 
field, not considering the masked areas around bright 
sources (K s < 12), areas of poor image quality and the 
field boundaries, consists of 306,000 sources i 2 ^ 

As detailed in 110 photometric redshifts (hereafter 
photo-z's) were assigned to all 3.6 fim detected sources. 
The vast majority of sources is also detected at opti- 
cal wavelengths an d therefore contain ed in the COSMOS 
photo-z catalog^ (Ilbe rt et all [2009) so that in general 
photometric information from 31 narrow-, intermediate 
and broad-band FUV-to-mid-IR filterbands was avail- 
ableE3 Within the remaining 4 % (i.e. a total of 8507) of 
the 3.6 fim sources 2714 are also cont ained in the COS- 
MOS if— band selected galaxy sample (jMcCracken et all 
I2010D and are also regarded as real sources. 110 assigned 
photo-z's to these extremely faint objects using the avail- 
able NIR-to-IRAC photometry. 

The quality of the photo-z's was estimated (for details 
see 110) by using spectroscopic redshifts for a total of 
4,148 s ources at mA B (i +) < 22.5 from the zCOSMOS 
survey ([Lilly et all 120091 ). At a rate of < 1 % of outliers 
the accuracy was found to be o-f Zpbot _ ZBpec yn +ZBpec ^ = 
0.0075 down to the magnitude limit of the spectroscopic 
sample. For all objects within the 3.6 fim selected cat- 
alog - regardless of i-band magnitude the accuracy was 
derived by using the la uncertainty on the photo-z's from 

24 As a stacking analysis depends on the input sample prior 
masked areas consequently reduce further the effective area for 
this study. All space densities reported in this work are therefore 
computed for an effective field size of 1.49 deg 2 . 

25 This optically deep sample has a limiting magn i tude of 26.2 
in the i+ selection band (see Tab. 1 in Salvato ct al. (2009)). 

26 As described in detail by 110 all photo-z's used in our study 
were obtained using a y 2 template-fitting procedure impleme nted 
in the code he Phare HArnouts et al.H2002t lllbert et aTll 20061) and 
a library of 21 templates. Additional stellar templates were used 
to reject stars (i.e. sources with a lower \ 2 values for the stellar 
compared to the galaxy templates) from the final galaxy sample. 
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the probability distribution function which yields a con- 
se rvative estim a te of t he photo-z uncertainty as detailed 
in lllbert et all (|2009ft . At 1.25 < z < 2 the relative 
photo-z uncertainty is 0.08 and thus higher by a fac- 
tor of four compared to the median value for the full 
(771ab(3.6 /jm) > 23.9) sampleFI We account for this 
when binning the data in redshift by choosing increasing 
bin widths with increasing redshift(3 It is worth not- 
ing that the photo-z accuracy is degraded at magnitudes 
fainte r than m^fi" 1 ") = 25.5 (See Fig. 12 in lllbert et al.l 
(2009)). Our choice of lower stellar mass limits (see Sec. 
12. 6[) and our stellar mass binning-scheme (see Sec. 12. 6|) 
automatically ensures a low fraction (< 15 %) of these 
optically very faint objects within the lowest mass-bin 
above our mass limit at any redshift. The fraction of such 
faint objects effectively vanishes towards higher masses 
as also pointed out by HOP 9 ! 

2.3. Estimation of stellar masses 

Stellar masses for all objects within the 3.6 fim selected 
parent sample have been computed by 110. Here, we 
briefly summarize the method and the important find- 
ings. For the estimation of stellar masses based on a 
Chabrier IMF stellar population sy nthesis models gen- 
erated with the package provided by lBruzual fc Charlotl 
(2003) (BC03) have been us ed. Furthermo r e an expo- 
nentially declining SFH and a lCalzetti et al.l (2000) dust 
extinction law have been assumed. Spitz er MIPS 24^m 
flux densities (from iLeFloc'h et al.l 120091 ) have been in- 
cluded in the SED template fitting as an additional con- 
straint on the stellar mass. Systematic uncertainties on 
the stellar masses, caused by the use of photo-z's, the 
choice of the dust extinction law and library of stellar 
population synthesis models, have been investigated. No 
systematic effect due to the use of photo-z's is appar- 
ent. Stellar masses derived from the BC03 templates are 
systematically higher by 0.13-0.15 dex co mpared to the 
newer Chariot & Bruzual (2007) versions (|Bruzualll2007|) 
that have an improved treatment of thermally pulsing 
asymptotical giant branch (TP-AGB) stars. As BC03 
models are commonly used in the literature, both stud- 
ies, 110 and this work, are based on BC03 mass estimates. 

2.4. Spectral classification 

A number of studies suggest the existence of a bi- 
modality in the SSFR-M^ plane (e.g. iSalim et al.l 120071; 
Elbaz et al.l l2007t iSantini et all l200alRodighiero et al] 
2010al) leading to a tight SSFR-sequence to be in place 
only for SF galaxies. Therefore a deselection of quiescent, 
i.e. non SF, objects is needed. 

Following 110 we classify galaxies with a best-fit BC03 
template that has an intrinsic (i.e. dust unextincted) 

27 For a color-selected sub-set of galaxies for which spectroscopic 
redshifts from the zCOSMOS-faint survey (Lilly et al., in prep.) 
were available the photo-z accuracy was directly tested at 1.5 < 
z < 3. This yields an accuracy of &Az/(i+z) = 0.04 with 10% of 
catastrophic failures. 

28 It should be mentioned, how ever, that the projected-pair anal- 
ysis by Ouadri & Williams ( 2009 ) independently shows that photo- 
z's from data sets with broad- and intermediate band photometry 
like the COSMOS catalog are not expected to have very different 
photo-z errors at z > 1.5 than at lower redshifts. 

29 110 use comparable mass limits and their Fig. 8 the strong 
decline of the fraction of optically faint objects with mass at all z. 



rest-frame color redder than (NU V — r + ) te mp = 3.5 
as quiescent. Sever a l authors (e.g. Wvder et al.l 120071 : 
iMartin et all l2007bl: lArnouts et ail 120071 ) suggest this 



color to be an excellent indicator for the recent over past 
average SFR as it directly traces the ratio of young (light- 
weighted average age of ~ 10 s yr) and old (> 10 9 yr) stel- 
lar populations. Seeking for a color bimodality that dis- 
criminates galaxies with currently high from those with 
low star formation activity the NUV — r color appears 
therefore to be superior to purely optic al rest-frame col- 
ors such as U - V (e.g. iBell et al.ll200l . 

Using a dust uncorrected NUV — r + versus r+ — J 
rest-frame color-color diagraixFl 110 showed that in the 
range < z < 2 for (NUV — r + ) tC mp > 3.5 quiescent 
galaxies are well separated from the parent sample with- 
out severe contamination by dust-obscured SF galaxies. 
This quiescent p opulation is therefore comparable to the 
one classified by I Williams et al"] (2009) based on a U — V 
versus V — J rest-frame color-color diagram. 

Furthermore our quiescent population shows a clear 
separation from the parent sample with respect to galaxy 
morphology. 110 visually classified a subset of 1,500 iso- 
lated and bright galaxies from the 3.6 /im parent sample 
using HST/ACS images and found the quiescent popu- 
lation among those to be clearly dominated by elliptical 
(E/S0) systems. A further cut ((NUV - r+) tcmp < 1.2) 
was shown to efficiently separate late type spiral and ir- 
regular galaxies from early type spirals as well as the 
remaining tail of elliptical systems. As any such color 
cut effectively is a cut in star formation activity we dis- 
cuss the spectral pre-classification of SF systems in more 
detail in Appendix [C] 

2.5. AGN contamination 

A major concern arising in the context of using radio 
emission to trace star formation is contaminating flux 
from active galactic nuclei (AGN) . For some galaxies the 
total radio signal might even be dominated by an AGN. 
For our study, ideally, we should therefore remove all 
galaxies hosting an AGN from our sample. 

Cross-matchin g the most recent XMM-COSMOS 
photo-z catalog (jSalvato et "all 120091 : iBrusa et all 120101 ) 
with the 3.6 /im selected parent sample delivered a to- 
tal of 1,711 (i.e. ~ 1 %) X-ray detected objects. Most 
of these sources exhibit best- fit composite AGN/galaxy 
SEDtEl while a minor fraction is well fitted by an SED 
showing no AGN contribution. However, here all X-ray 
detections are treated as potential AGN contaminants 
and thus removed from our sampleF^l 

Studies of the radio luminosity function (e.g. 
ISadler et al]l2"002T : ICondon et al.ll200l agree that radio- 

30 Here the absolute magnitudes were inferred from the observed 
magnitudes not accounting for dust reddening. 

31 Based on the Salvato et al. (2009) classification that uses an 
enhanced set of AGN/galaxy templates in order to fit the FUV- 
to-mid-IR SED and that includes further priors (e.g. variability 
information) in the fitting procedure while delivering accurate pho- 
tometric redshifts for all these sources. 

32 Note that IHickox et all (t2009t) and IGriffith fc Stern] d201Qh 
yield strong evidence that X-ray and radio selected AGN are mu- 
tually distinct populations such that it is actually questionable to 
remove X-ray selected objects from our samples. We confirmed 
that our results do not change significantly when including those 
objects and urge caution to remove more objects if deeper X-ray 
data compared to the XMM imaging used here is at hand. 
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AGN contribute half of the radio light in the local uni- 
verse at radio luminosities slightly below L\a GHz ~ 
10 23 W/Hz and outnumber SF galaxies above ~ 
2 x 10 23 W/Hz. Detailed multi-wavelen gth studies 
(|Hickox et all [200l IGriffith fc Sterol [2010^ yield that 
radio-AGN are hosted by red galaxies. The evolution 
out to z ~ 1.3 of the radio-AGN fraction for luminous 
(i.e. L1.4 ghz > 4 x 10 23 W/Hz) radio-A GN as a func- 
tion of stellar mass has been presented bv ISmolcic et al.1 
( 2009b]) who selected a parent sample of red galaxies with 
rest-frame U — B colors in a range close to our quiescent 
galaxy fraction. The derived AGN-fractions at a given 
stellar mass within the red galaxy population are there- 
fore applicable to our sample . 

According to ISmolcic et all (|2009bD (see their Fig. 11) 
the luminous radio-AGN fraction at0.7<z<1.3is well 
below 25 % at all log (M*[M ]) < 11.5 where it drops 
quickly to ~ 1 % at log (M*[Af Q |) = 11 and continuously 
to lower levels as stellar mass decreases. At masses lower 
than log (M, [M Q ]) = 11 the radio-AGN fractions are 
subject to non-negligible evolution between < z < 1 
while the fractions at higher masses increase only mildly. 
However, given that the radio-AGN fractions are well be- 
low 1 % in the former (i.e. low) mass range out to z ~ 1.3 
it is unlikely that they rise above 10 % at z 3> 1. The 
evolution of the radio-AGN fraction at the high- mass end 
is much slower but the fractions are high already in the 
local universe. We therefore set an arbitrary but reason- 
able threshold and exclude all quiescent objects above 
log (M*[M©]) = 11.6, where the expected radio-AGN 
fraction exceeds 50 %, from our stacking analysis. As 
the radio-AGN fraction sharply drops below this limit 
the remainder of our full galaxy sample should be gen- 
erally free from radio-AGN contamination. Within the 
highest mass bin probed here (M* > 10 11 M Q ; see Fig. 
[2]), however, the average fraction of radio AGN among 
the quiescent galaxies could still be ~ 25 % at z > 1. 
This fraction appears high but among the entire galaxy 
population (quiescent and SF sources) the percentage 
drops to at most 10 % within our highest mass-bin at 
z 1. As shown by 110 globally, but in particular at 
M* > 10 11 Mq the fraction of quiescent galaxies among 
the entire samp le decreases strong ly towards higher red- 
shifts (see also iTavlor etal] 1200 9) Fl An upper bound 
of 10 % to the potential fraction of radio-AGN within 
our highest mass-bin hence is a well justified number at 
z > 1. 

Due to prominent spectral features we regard the SED- 
fits for quiescent objects as most trustworthy such that 
also the SED-derived SFRs are expected to be accu- 
rate for individual objects. These SFRs therefore serve 
as a prior for revealing potential radio-AGN among the 
radio-detections in our sample. Hence we correlated our 
sample with the latest version of the VLA-COSMOS 
catalog ([Schinnerer et al.1 l2010T l and excluded those ob- 
jects showing radio-derived SFRs more than twice as 
large as the SED-derived values. We find that the over- 
all number of objects excluded in each sample to be 
stacked is negligible. The same holds for very luminous 
{L\a GHz > 10 25 W/Hz) radio sources among the radio- 
detections that are most likely high-power radio-AGN. 

33 The global stellar mass density of quiescent galaxies at z = 1.5 
is about an order of magnitude lower than the SF one. 



We therefore excluded also these objects relying on indi- 
vidual photo-z's in order to estimate the radio luminos- 
ity. The total fraction of galaxies among all objects in a 
given bin that we exclude by these two criteria amounts 
- on average - to less than 0.3 % such that only a frac- 
tion of radio detections is rejected. We stress the small- 
ness of this percentage as the advantage of our radio- 
approach is its insensitivity to dust obscuration which 
might be challenged by relying on individual optical best- 
fit SEDs as we partially do when removing some of the 
radio-detected objects. It should be noted that the high- 
power radio-AGN candidates are exclusively hosted by 
red galaxies within our sample. Hence, X-ray detected 
sources are the only objects that have been removed from 
our SF samples. 

As the radio-based SFR-results presented in this paper 
(see Sec. EJ are based on a median stacking approach (see 
Sec. [3]) a minor fraction of contaminating outliers such 
as AGN is even tolerable. We conclude that contamina- 
tion of the stacked radio flux densities caused by AGN 
emission at radio frequencies is not a siginifcant source of 
uncertainty in the context of this study and that our con- 
clusions would not change if we included the radio-AGN 
candidates in our analysis. 

2.6. Completeness considerations 

In the following we will discuss the completeness of 
our (sub-) samples. It is important to distinguish be- 
tween two kinds of effects. While the full 3.6 /im-selected 
source catalog (1) is subject to a flux density-dependent 
level of detection incompleteness we are interested in (2) 
how representative for the underlying population a given 
subset of galaxies is at a given mass. Our lower mass 
limits hence need to be chosen such that the objects at 
hand remain sufficiently representative. 

110 evaluated the efficiency of the source extraction 
procedure (and hence the detection completeness) with 
Monte Carlo simulations of mock point-sources inserted 
into the 3.6 /im mosaic. At the flux density cut of 
1 /iJy (toab(3.6 /im) = 23.9) the catalog was found 
to be 55 % complete; 90 % completeness is reached at 
F3.6 fim ~ 5 /iJy (mAB(3.6 /im) = 22.15). This rather 
shallow decline in detection completeness towards the 
magnitude limit is due to source confusion. 

Fig. [T] shows the distribution of 3.6 /im flux density 
with stellar mass in narrow redshift slices for our source 
catalog, color coded by the spectral type of the galaxies 
(see Sec. 12. 4[) . The Monte Carlo detection completeness 
levels of the catalog are indicated by horizontal dashed 
black lines starting from the flux density limit at the bot- 
tom to the 95 % completeness limit at the top in each 
panel. Each sub-population shows a clear correlation be- 
tween 3.6 /im flux density and stellar mass, and the qui- 
escent population residing at the high-mass end at all 
flux densities. While SF sources (the union of all blue 
and green data points) span the entire range of 3.6 /im 
flux densities at all redshifts, hardly any quiescent ob- 
jects with low flux densities are observed at intermediate 
and high redshifts. We consequently find fewer and fewer 
low-mass quiescent objects as redshift increases. This is 
certainly the combined effect of a general absence of such 
sources at higher redshifts plus the loss of these objects 
at low flux densities due to the global detection incom- 
pleteness of our catalog. 



The star formation history of mass-selected galaxies in the COSMOS held 
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Figure 1. Observed 3.6 fim flux versus stellar mass from SED fits. The 12 panels show photometric redshift bins, as 0.2 < 2 p hot < 3 
indicated in the upper left part of each panel. Flux densities relate to AB-magnitudcs via toab(3.6 ^m) = — 2.5 log 10 (F3 g Mm [jitJy]) + 23.9 
(23.9 is the magnitude limit of the catalog). Blue points denote highly active SF systems, with intrinsic rest-frame template colors 
(NUV — r + )temp < 1.2; red points denote quiescent (low star formation activity) galaxies with (NUV — r + )t C mp > 3.5. Green points 
are objects of intermediate intrinsic rest-frame color (and hence star formation activity). Horizo ntal dashed lines mark the levels of the 
detection completeness, estimated through Monte Carlo simulations of artificial sources (see Sec. 12. 61 1. The vertical dashcd-dotted line in 
each panel denotes the lower mass limit, to which the sample of SF systems (i.e. the union of all blue and green points) is representative 
of the underlying SF population and the SFR is not affected by the intrinsic catalog incompleteness. The solid vertical line in each panel 
denotes the mass limit to which the entire sample is regarded as representative. 



Detection incompleteness affects all sources in at a 
given 3.6 \xm flux density, regardless of their spectral 
type. However, the different distribution of quiescent and 
SF sources with respect to 3.6 fim flux density necessi- 
tates that a different lower mass limit ('representative- 
ness limit', hereafter) be adopted, depending on whether 
we consider the redshift evolution of SF galaxies or that 
of the entire galaxy population. We now discuss how the 
limiting mass is set for these two samples: 

• In the case of the entire galaxy population, it is im- 
portant to be working with a sample in which the 
fractional contribution of quiescent and SF sources 
reflects the true population fractions as closely as 
possible. The probability that this is the case be- 
comes larger, the better the underlying popula- 
tion is sampled; i.e. it rises with increasing de- 
tection completeness. We therefore require an in- 
trinsic catalog completeness of 90 % (correspond- 
ing mAB(3.6 /rat) = 22.15) at all masses considered. 
This is an arbitrary but reasonable threshold as the 
intrinsic catalog completeness rises rapidly towards 
higher flux densities. 

In order to evaluate the actual mass representa- 
tiveness limit we need to define yet another type of 



completeness level, which we shall refer to as sta- 
tistical completeness. By applying the analytical 
scheme described in detail in Appendix [A] we en- 
sure that the statistical completeness of our sample 
always reaches at least 95 %. This value sets the 
actual level of representativeness of a given sub- 
sample. In the following we will also present re- 
sults for sub-samples below the evaluated mass- 
limits which will be indicated separately. Those 
results represent strict upper limits in (S)SFR. 

• For studying the SF population we need not be as 
conservative because we are dealing with a single 
sub-population that is subject to less internal varia- 
tion of SF activity as a (bimodal) sample including 
both quiescent and SF systems. We thus consider 
sources down to the limiting flux density of the 
3.6 /im catalog when we compute the mass limits 
at a given redshift. Since this implies that at low 
stellar masses the flux distribution is sharply cut 
due to the magnitude limit of our catalog, we still 
need to use the scheme presented in Appendix [A] 
to identify stellar mass limits that provide a repre- 
sentative flux density distribution for SF galaxies. 
As visible in all panels of Fig. [1] the lowest mass 
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Table 1 

Stellar mass limits for all/SF galaxies 



z 


All galaxies 

iOg(M t [iMQjJlim 


SF systems 

iog(A-j, [-Mqj; 


0.3 


9.7 


8.8 


0.5 


9.8 


8.9 


0.7 


10.0 


9.1 


0.9 


10.1 


9.1 


1.1 


10.2 


9.3 


1.3 


10.4 


9.1 


1.5 


10.4 


9.5 


1.7 


10.5 


9.6 


1.9 


10.8 


9.7 


2.1 


10.8 


9.8 


2.3 


10.8 


9.9 


2.5 


10.9 


9.9 


2.7 


11.0 


10.1 


2.9 


11.1 


10.2 



Note. — The lower stellar mass limits 
above which our samples are regarded rep- 
resentative. Those limits are as shown in 
Fig. m and [T2l and have been derived based 
on the scheme that is detailed in Appendix 

m 

bin always contains objects over the full range of 
detection completeness, from 55 % to 100%. One 
might expect - and the SED fits confirm this - 
that among galaxies of a given mass, those with the 
fainter fluxes have lower SSFRs. Failure to include 
them (due to detection incompleteness) would thus 
yield average radio-derived SSFRs that are biased 
towards higher values. We wish to emphasize, how- 
ever, that our choice of the statistical completeness 
level ensures that this bias is small above our mass 
limit and that our samples hence are 'representa- 
tive' in the sense that they can be expected to ren- 
der a meaningful measurement of, e.g., the average 
SSFR of the underlying population. 

The stellar mass representativeness limits for the whole 
sample and the SF systems are marked in Fig. Q]as verti- 
cal lines for each redshift bin in the range 0.2 < z p hot < 3 
and listed in Tab. [TJ Note that they increase with 
redshift. As a consequence, our results will be based 
on fewer mass bins at high redshift and the aforemen- 
tioned bias in the lowest mass bin may therefore have 
a larger impact on fitting trends. Very conservatively 
speaking, our results for SF objects presented in the fol- 
lowing should generally be regarded as most robust at 
z < 1.5 while evolutionary trends inferred at the high 
mass end are robust out to our redshift limit of z — 3. 
We will also show results for SF galaxies obtained at 
masses lower than the individual mass limits and treat 
them as not entirely representative. Such measurements 
will be indicated with different symbols in our plots and 
we will discuss any further implications in Sec. 14.41 

The final sample of galaxies with mAB(3.6 /im) = 23.9 
and Zphot < 3 consists of 165,213 sources over an effective 
area of ~ 1.5 deg 2 . Fig. 3 in 110 shows the redshift 
distribution with a median of Zphot ~ 1.1. After adopting 
a lower redshift limit of z p hot = 0.2 in order to account 
for the small local volume sampled by our effective area 
and our binning scheme 113,610 sourceiQ (90,957 SF 

34 This number already considers the upper limiting mass for 



galaxies) enter our analysis. This is by far the largest 
galaxy sample used for studying the dependence between 
SFR and stellar mass throughout cosmic time. Fig. [2] 
shows the adopted binning scheme and the number of 
galaxies contained in each stellar mass and photo-z bin. 

3. METHOD AND IMPLEMENTATION OF RADIO IMAGE 
STACKING 

The bulk of objects in our 3.6 /Ltm selected sam- 
ple is not individually detected in the 1.4 GHz contin- 
uum. An estimation of the SFR based on the radio 
flux density for every object in the sample is therefore 
impossible. On the other hand, studying only radio- 
detected galaxies in this sample yields effectively a se- 
lection by SFR and not by stellar mass since only radio- 
bright, i.e. highly active star forming, normal galax- 
ies remainJ3 By co-adding postage stamp cutout im- 
ages of the 1.4 GHz map at the positions of sources in 
the sample it is possible to estimate the typical radio 
properties for a specific galaxy population. Usually re- 
ferred to as stacking, this technique has proven to be 
a powerful tool to estimate the typical flux density of 
gala xies with a given property, not only in the radio 
e.g. iWhite et al.l 12007^ iCarilli et all 120081: iDunne et all 
Pannella et all l2009t iGarn fc Alexander! I200S ' 

12010ft but also in the 



2009; 



Bou rne et all 



mid-IR (e.g. 



2010: Mcssias et al 
Zheng et all 120061 



as 



2007al lbl: iMartin et al l 
2007al: iBourne et al.ll2010h. far-IR (e.s. Ilee et all 120101: 
Rodighiero et al.ll2010al: IBourne et al l 12010ft as well as 
sub-m m (e.g. iGreve et a,l.ll2009t iMartinez-Sansigre et all 
2009). The list can be extended to other wavebands al- 
ways requiring a galaxy sample representative for the 
underlying population. 

3.1. Median stacking and error estimates 

Our stacking algorithm uses cutouts with sizes of 
40" x 40", centered on the position of the optical coun- 
terpart. Since the COSMOS astrometric reference sys- 
tem was provided by the VLA-COSMOS observations 
the positional accuracy between radio and optical sources 
should b e well within the errors o f both datasets. As de- 
tailed in iSchinnerer et all (|2007l ) the relative and abso- 
lute astrometry of the VLA data are 130 and < 55 mas 
respectively. In other words the average distribution of 
radio flux follows the one at optical wavelengths and 
the central pixel in any stacked image was always the 
brightest one. Averaging over pixels located at the same 
position in each stamp hence is an astrometrically well- 
defined problem. 

It can be approached by computing either the mean 
or the median of the mentioned set of pixels. The re- 
sulting stamp then shows the spatial distribution of the 
average radio emission for the sample studied. For an 
input sample of N galaxies its background noise level 
should correspond to ~ l/\/N of the noise measured 
in a single radio stamp Any sample of galaxies in a 

quiescent galaxies as discussed in Sec. 12.51 and excludes further 
328 sources (i.e. 0.3 %) classified as radio-AGN. 

35 The currently deepest radio surveys (e.g. [Owen & Morrison! 
2008, with rmsx 4 ghz ~ 3 fiJy) individually detect galaxies with 
SFRs > 50 M /yr at a redshift of Z = 1. 

36 Our image stacking implementation automatically monitors 
the decrease of the background noise level. For all results presented 
here it was verified that this decrease follows a ~ l/<\/N law. 
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Figure 2. Binning scheme in stellar mass and photometric redshift for the entire (left) and the SF (right) sample. Hatched bins lie below 
the corresponding limits denoted in Fig. [T]and are hence regarded representative of the underlying galaxy population. The top number 
in each box is the total number of galaxies used in the radio stack; the bottom number shows the signal to noise ratio achieved in the 
radio stack. In the left panel, the middle number is the amount of potential radio-AGN (not detected in the X-ray) that has been excluded 
from the stack. In the right panel this number gives the amount of optically very faint sources only detected redwards from the K-band. 
No radio-AGN candidate has been found among the radio-detected sources in the SF sample and only X-ray detected objects have been 
removed. The total number of galaxies per redshift bin is given below the panels. 

given bin of redshift and stellar mass likely contains also 
a fraction of sources with radio detections. Even if this 
fraction is small, the mean is sensitive to the large excess 
in 1.4 GHz flux density compared to the average radio 
emission of the individual non-detections. On the other 
hand, setting a threshold and excluding radio detections 
from the stack artificially changes the sample and the 
results, hence, depend on the threshold applied. 

In addition, foreground objects and other extended ra- 
dio bright features (e.g. lobes from radio galaxies) need 
to be handled with care and might be a source of contam- 
ination affecting the noise in the final stamp but also po- 
tentially the signal itself. It is therefore beneficial to ex- 
clude stamps showing these features from a mean stack. 
Typically, significantly less than 1 % of objects in a sam- 
ple are rejected and the effect of this artificial cut on 
the sample thus is negligible. However, by resorting to 
the median, the stacking technique becomes more ro- 
bust a gain st outliers allowing the use of the entire input 
samplejfj- Non-uniform noise properties within the ra- 
dio map can also be addressed by applying a weighted 
scheme to compute the median (see Appendix |B|) . While 
it is often argued that there is no straight-forward way of 
interpreting the sample median compared to the sample 



37 We applied the different stacking techniques discussed above 
to some of our sub-samples. We found the median flux densities 
obtained to be within < 7 % of those obtained when using a mean 
stacking technique that excludes radio-stamps including extended 
foreground features. For the mean stack we co-added objects in 
a given sub-sample that are not individually detected in the radio 
imaging and the flux density of the detected sources has been added 
to the flux density obtained from the stack in a noise-weighted 
fashion. This ensures that those objects that are not individually 
radio-detected - i.e. the bulk of our sources - are most strongly 
weighted 



mean, iWhite et al.l (|2007|) showed that the median is a 
well-defined estimator of the mean of the underlying pop- 
ulation in the presence of a dominant noise background. 

Although, strictly speaking, these arguments only ap- 
ply to the case of pure point sources, the condition of a 
dominant noise background is given in our study. One 
has to be aware of the fact that there is in principle 
no possibility to access the intrinsic distribution of ra- 
dio peak fluxes of the underlying population as a whole. 
The observed distribution merely is the intrinsic one as 
smeared out by the gaussian noise background. How- 
ever, it still contains information that needs to be used 
in order to find proper confidence limits for any statistic 
applied. Based on the above arguments, we expect the 
broadened distribution to be not only shifted but also 
skewed towards positive flux density values. As a re- 
sult, the uncertainty for the obtained peak flux density 
is poorly estimated by the background noise in the final 
stamp. Using a bootstrapping technique (see Appendix 
IB.2[) allows us to obtain more realistic, asymmetric error 
bars for our measured peak flux densities. 

3.2. Integrated flux densities, luminosities and SFRs 
from stacked radio images 

So far we considered only the average peak flux density 
which, to first order, would not require to stack individ- 
ual cutouts but only their central pixel. However, the 
typical galaxy of a given sample might exhibit extended 
radio emission. In that case the peak flux density is no 
longer equivalent to the total source flux but underesti- 
mates the typical radio flux density and hence all other 
quantities derived from it. 

The effect of bandwidth smearing (BWS), chromatic 
aberration caused by the finite bandwidth used during 
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Figure 3. Exam ples for 40" X 40" (i.e. (115 X 115) pixels) 1.4 GHz postage stamp images obtained via median stacking of star forming 
galaxies (see Sec. 12.411 in the redshift bin between 1.6 < z < 2. The number of galaxies for which individual radio cutout images from the 
VLA-COSMOS map (resolution of 1.5" X 1.4") have been co-added is given at the upper left of each stamp in the top row while the number 
at the lower right denotes the bin extent in log(M* [M©]). Due to the high signal- to- noise ratio s (S NRs) achieved, generally, a clear (dirty) 
beam pattern is visible. The bottom row shows the corresponding CLEANed stamps (see Sec. 13 21 for details). Contour levels are at 2, 4, 
5 ffbg an d followed by steps of 5 cr^ s (The individual SNRs are given in Fig. [2] and flux densities measured as well as the background noise 
levels reached are listed in Tab. [3j- 



the VLA-COSMOS observations leads to a spatial broad- 
ening ol a source even il it is intrinsically point-like. 
Within a single pointing the BWS increases with increas- 
ing radial distance from the pointing center and the effec t 
is analytically well determined (e.g. iBondi et alj [2008). 
For a mosaic like the VLA-COSMOS map that consists 
of many overlapping pointings the effect becomes ana- 
lytically unpredictable due to the varying uncertainties 
introduced by the calibration and observing conditions. 

For all our samples we constructed median co-added 
cutout images (Fig. [3]) and determined accurate RMS- 
noise estimates (hereafter attack) f° r the image stacks as 
described in Sec. 13.11 These (115 x 115) pixel 2 dirty 
maps were processed within AIPSF 5 ! 

We used the task PADIM to make the stacked images 
equal in size to a (512 x 512) pixel 2 image of the VLA- 
COSMOS synthesized (dirty) beam by filling the outer 
image frame with additional pixels of constant value The 
task APCLN with a circular CLEAN box of radius of 
seven pixels (i.e. 2.45") around the central component 
was then used to CLEAN each dirty map down to a flux 
density threshold of 2.5 x astaclB 

Integrated flux densities, as well as source dimensions 
and position angles after deconvolution with the CLEAN 
beam were obtained by fitting a single-component Gaus- 
sian elliptical model to the CLEAN image within a 
quadratic box of (15x 15) pixel 2 around the central pixel 
using the task JMFIT. Errors on the integrated flux den- 

38 Note that only bright (> 45 fiJy) radio sources have been 
CLEANed in the individual pointings prior to the assembly of the 
final mosaic. Hence, a stack of fainter sources will display a clear 
beam pattern as seen in Fig. [3] which must be deconvolved. 

39 This is a conservative threshold. We confirmed that this 
choice does not lead to systematic biases by CLEANing individual 
stacked images down to 1 X crgt^^. Integrated flux densities ob- 
tained from both approaches do not differ by more than 3 % and 
do not lead to mass-dependent effects. The mentioned fluctuations 
are well within the error margins. 



sities have been estimated according to Hopki ns et al.l 
( 2003) and rely on the combined information on the best- 
fit source model and the bootstrapping results from the 
image stacking: 



& Total 
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where (IWindhorst et al.|[l98llCondonlll997l and also the 
explanations in ISchinnerer et al. 
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1.5" is the major axis and 6* m = 1.4" the minor 
axis of the beam while 9m and 9 m are the major and mi- 
nor axis of the measured (hence convolved) flux density 
distribution. In order to include the bootstrapping error 
estimates we set S/N — (-F P eak)/cbs, i-e. the ratio of 
the peak flux density in the stacked dirty map and the 
68 % confidence interval resulting from the bootstrap- 
ping. The same applies to the parameter-dependent es- 
timators of the fit entering equation ([3]) that are given 
by: 



CM Cm 

4c B Cb 



1 + ir 



(J) 



and a = b = 1.5 for pp, a = 2.5 and b = 0.5 for pm as 
well as a = 0.5 and b — 2.5 for p m . 

For a given sub-sample centered at a given median red- 
shift ( 

Zphot) the average (median stacking based) inte- 
grated flux density (i<Total) observed at 1.4 GHz can be 
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Figure 4. Ratio of integrated to peak flux density at 1.4 GHz 
of the stacked radio images for different sample sub-sets. The left 
panel shows results for the entire sample, the right panel the sub-set 
of blue (SF) galaxies. The data is color-coded by redshift and the 
dashed black line depict s the uniform corre ction factor used in the 
radio stacking study by Dunne ct al. (2009). It is evident that the 
extent of radio emission is not uniform across our samples because 
our radio imaging has a higher angular resolution compared to the 
VLA map used by Dunne ct al. (2009). All measured data points 
arc listed in Tab. |2]and[U 

directly converted into a rest- frame 1.4 GHz luminosity 
using a K-correction that depe nds on the rad io spectral 
index a rc (here a rc = —0.8, e.g. [Condon 1993): 

(Li A GHz) [W/Hz] =9.52 x 10 12 (F To tai) [/xJy] 

X (d l [Mpc]^) 47r^l + ( Zph ot)) (5) 

with Dl the luminosity distance at this median photo-z 

of all objects inside the bin. 

It was pointed out bv lDunne et al.l (|2009D that the me- 
dian redshift might not be appropriate for estimating the 
radio luminosity if the peak of the radio flux density dis- 
tribution does not coincide with the median of the photo- 
z distribution. They overcame this problem by deriving 
(and subsequently effectively stacking) luminosities ac- 
cording to Eq. ([3]) for all objects relying both on the indi- 
vidual photo-z's and peak flux density measurements at 
the pixel corresponding to the position in the input cat- 
alog. At z > 0.2 they afterwards applied a common (i.e. 
redshift independent) factor to the median of all obtained 
luminosities to correct for the difference between peak 
and total flux density as well as for the effe ct of BWS. A 
simila r approach was rec ently also used bvlBourne et al.l 
l|2010f) . The method by iDunne et~aH (|2009|) is justified 
given their data as they find for z > 0.2 that the ratio of 
total to peak flux density does not change significantly, in 
particular not as a function of K-band magnitude. How- 
ever, our data does not yield such a uniform behavior 
with respect to mass in the correction factor as Fig. H] 
shows. Indeed, if we were to state an average peak to to- 
tal flux density conversion it would be a function of mass. 
An expl anation for th e discr epancy of our findings com- 
pared to IDunne etHI (2009) can be found in the use of 
higher resolved A-array data in our case compared to the 
B-array data constituting their radio continuum imaging. 
Hence, both results are correct given the respective data 
used and show that higher resolved radio data needs to 



be treated differently. The spread in conversion factors 
within our sub-samples is large and lower redshift objects 
show a significantly larger (-FTotai)/(-Fpcak) raticjf^ (see 
Fig. 3]). Moreover, further variations might arise depend- 
ing on the galaxy population studied. Hence, if high- 
resolution data is used, results are more robust when 
first total flux densities are individually derived for any 
radio stacking experiment before co mputing radio lumi - 
nosities. As it is apparent from the IDunne et al.l (2009) 
results their method should be considered, however, if 
stacking is used to infer the average radio luminosity of 
an entire galaxy population with a broad redshift distri- 
bution (Az > 1). As our broadest bins in redshift have 
Az = 0.5 - and this only at z ^> 1 where they span a 
much smaller range in time - it is indeed more accurate 
to rely on our approach given our radio imaging. 

In order to convert the derived average 1.4 GHz lumi- 
nosities into average SFRswe use the calibration of the 
radio-FIR correlation by Bell ( 2003) scaled to a Chabrier 
IMrQ 



(SFR) [M /yr] = 



3.18 x 10~ 22 L 

3.18xl0~ 22 L 
0.1+0.9 (L/L c )" a 



, L> L c 
,L<L C 



(6) 



where L = (L1.4 GHz) is the average radio luminosity 
derived from the median stack according to Eq. ([5]) 
and L c = 6.4 x 10 21 W/Hz i s the radio luminosity of 
an L»-like galaxy. As Bell (2003) empirically argues 
the low-luminosity population needs to be treated sep- 
arately from higher values of radio luminosities since 
non-thermal radio emission might be significantly sup- 
pressed in these galaxies. Even though our work exploits 
the radio- faint regime our derived average 1.4 GHz lumi- 
nosities lie generally above this threshold. Only at the 
lowest masses and z < 0.8 we find (L1.4 GHz) < L c (see 
Ta b. [2] and [31 . A ny study relying on the calibration 
bv lYun et al.1 ()2001[ ) i s, consequently, d irectly compara- 



ble to our results as lYun et all (|200lD used a uniform 
normalization very similar to the case L > L c in Eq. 
©0 According to iBelll (|2003l ) individual objects scat- 
ter about the average calibration by about a factor of 
two. It is not necessary to include this dispersion in the 
estimation of the final uncertainty on the SFR computed 
from the stack since the latter involves a sufficiently large 
number of sources to ensure that the average relation is 
representative. We do not attempt to take the differences 
of the derived SFRs caused by the discrepancy of the 
mentioned calibrations into consideration for the error 
estimates of our results. We also neglect any uncertainty 
on the median photo-z so that all errors on the derived 
SFRs result from propagation of the errors derived using 

40 Note that a larger conversion factor is equivalent to a larger 
source extent. Since it is unlikely that the varying number counts 
in our sub-samples are responsible for mass- or redshift-dependent 
source sizes we infer that higher mass objects are intrinsically more 
extended at all redshifts compared to their lower mass siblings. 
The larger correction factors at lower z can be explained by the 
increasing angular diameter distance towards higher z. 

41 IBelll 1120031 ) adopts a Salpeter initial mass function with IMF ex 
M -2 ' 35 in the mass range from 0.1 to 100 Mq so that we divide 
his normalization by 1.74. 

42 A ra dio l uminosity independent cali bration has al s o bee n pre- 
sented bv ICondor] 11992D . We refer to IDunne et al.l d2009|) who 
presen t all their results using both the [Bell (2003) and I Condon! 
(1992) calibration. 



Table 2 

Radio stacking results for the entire mass- selected sample 
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Note. — Median stacking-bascd average 1.4 GHz radio flux densities and derived average quantities for all our bins in mass and rcdshift for the entire 
mass-selected sample . A Chabrier 12003) IMF is assumed. Radio luminosities arc stated in units of L c , the threshold luminosity below which [Bell 120031 ') 
empirically found the non-thermal radio emission to be suppressed (see Eq. |6( ) . Resulting SFR.s from bins with lower radio luminosity are hence boosted 
compared to e.g. the calibration of the radio-IR. relation by Yun et al. I 2001). The median stellar mass and median z for any given bin are also stated. 

Mass bin contains data below the limit of mass representativeness and yields an upper limit to the average SFR (see Sec. |2.6| for further details.) 
* First mass bin above the limit of representativeness (see Sec. |2. 61 which contains a low fraction (< 15%) of optically faint objects with rnj^^{i~^~) > 25.5 
for which the photo-z accuracy is degraded (see Sec. |2 . 2| for further details). 
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Radio stacking results for star forming systems 
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'.ss- selected sample . For details see 
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■ f Tab. [2] 



rage quantities for all < 



; and rcdshift for star forming 



Mass bin contains data below the limit of mass represeiitati veneris and yields an upper limit to the average SFR (see Sec. f2T6l for further details.) 
First mass bin above the limit of representativeness (see Sec. |2.6( which contains a low fraction (< 15 %) of optically faint objects with ^^^(^ ) ^ 25.5 
for which the photo-z accuracy is degraded (sec Sec. |2 . 2| for further details). The average SFR measured in this bin might be slightly overestimated towards 
higher values (sec Sec. f2~6| . 
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equation 

Finally, for a given sample, specific SFRs are computed 
as the ratio of the SFR and the median stellar mass. 
Based on the same arguments as before we neither take 
into account an uncertainty in the median mass for the 
error estimates of our derived SSFRs. As we exclusively 
deal with average quantities in this work we omit the 
()-notation in the following. 

4. THE SPECIFIC SFR (SSFR) OF MASS-SELECTED 
GALAXIES OVER COSMIC TIME FROM RADIO 
STACKING 

In the remaining parts of this paper we present our 
measurements of the SSFR-M* relation (this Section) 
and discuss their implications for the evolution of the 
cosmic SFR density (Sec. [3]). 

4.1. The relation between SSFR and stellar mass 

We first consider the whole sample including all galax- 
ies and show the redshift dependent radio-based SSFRs 
that are distributed in the logarithmic SSFR-Af, plane 
as seen in the left panel of Fig. [SJ It is clear that the 
SSFR for a given stellar mass increases with redshift and 
that it generally decreases with increasing stellar mass. 

The data at the high-mass end (above log(M*) « 10.5) 
within all considered redshift slices suggest power-law re- 
lations between SSFR and stellar mass of the form 

SSFR(M„,z) = c(z) x M/ (z) . (7) 

In the following we will refer to the index (3 also as slope 
since the relation is commonly shown in log-space. The 
dashed lines in Fig. [Sjdepict the best fit to the data in the 
mass-representative regime (see Sec. I2.6|) and indicate 
that only the normalization evolves while the power-law 
index /?all of the individually fitted relations shows mi- 
nor fluctuations but no clear evolutionary trend. Only at 
z > 1.5 there is tentative evidence for a somewhat shal- 
lower slope. However, at the highest redshifts probed too 
few mass-representative data points exist to perform the 
linear fit. Our evidence is solely supported by the offset 
between the SSFR of the most massive galaxies and those 
of intermediate mass remaining the same as at z ~ 1.8. 
Based on our data it therefore is justified to consider the 
index /3all in Eq. (0 a constant at least for all z < 1.5 
and log(M*) > 10.5 . 

At log(M*) < 10 — 10.5 we see at practically all epochs 
that the measured SSFRs significantly deviate from the 
relation fitted to the high-mass end. The extrapolation 
towards lower masses over-predicts the measurement. In 
Sec. I2.6l we argued that all these data points - lying be- 
low the mass representativeness limits - likely represent 
upper limits. We hence believe this is a genuine devia- 
tion that is reminiscent of the bimodality (whereby quies- 
cent galaxies preferentially populate the high-mass end) 
in the SSFR-M* plane confirmed at various redshifts for 
galaxy samples with individually measure d SFRs (e.g. 
Brinchmann et all [20041: i Salim et all 120071: lElbaz et al.l 
2007HSantini et al.ll200a iRodighiero et al.lboiOal) . 

Using our spectral classification scheme we separately 
study the SF galaxy population in order to break the 
afore mentioned bimodality. The right panel in Fig. [5] 

43 This is justified as this error scales with the number of objects 
l/x/A 7 where N > 10 2 given our binning scheme. 



shows that a power-law relation according to Eq. ([7]) 
holds over the entire mass range probed, once quies- 
cent galaxies are excluded. Linear fits exclusively to the 
mass-representative regime show that, at z < 1.5: (i) 
SSFR declines towards higher mass, and that (ii) the 
slope Psfg is constant, as it was the case for the en- 
tire galaxy population. Compared to the entire sample, 
the slope is significantly shallower^ All theses conclu- 
sions also hold at all other epochs probed but are sup- 
ported by fewer data points significantly above the mass- 
representativeness limits that enter the fits. Hence we 
regard our conclusions as most robust at z < 1.5. 

Above z ~ 1.4 and below log(Af») « 9.5 — 10, we again 
find that measurements in the regime not regarded as 
mass-representative lie significantly below the linear fits. 
Since quiescent galaxies are even less frequent at these 
redshiftfQ the bimodality argument is obviously insuf- 
ficient to explain this observed trend. A possible expla- 
nation is that the magnitude limit of our catalog leads 
to a loss of dust-dominated systems with low masses but 
high star formation activity. If this were the case our 
previous statement that SSFRs in the under-represented 
mass-regime are upper limits would not necessarily hold. 
However, we do not expect a sufficiently high number 
density of low-mass dusty starbursts to make this sce- 
nario plausible. Another explanation could lie in the dy- 
namical considerations presented in Sec. 14.21 

4.2. A potential upper limit to the average SSFR of 
normal galaxies 

The fact that the aforementioned deviations from the 
linear fits at low masses steadily grow with redshift hints 
at a solid upper limit to the average SSFR. Local spiral 
galaxies have on average a dynamical timescale - i.e. the 
rotation timescale at th e outer radius o f a disk galaxy 
- of Tri vn ~ 0.37 Gyr (jKennicuttj [1998D . iDaddi et al] 
(|2010bf) show that this still holds at z ~ 1.5. The in- 
verse of this dynamical timescale, l/rd yn ~ 2.7 Gyr -1 , 
is similar to the threshold that seems to prevent our 
average SSFRs from rising continuously with decreas- 
ing mass. Note also that this dyna mical timescale ap- 
proximately equals the free-fall time (Gcnzel et al.ll201fj| ) 
which is commonly used to r elate SFR volume density 
with gas volume density (e.g. iSchmidtl U9 59: Ke nnicuttl 
19981: IKrumholz fc McKeel [2005: Kr umholz et al.1 120091: 
Lerov et al.ll2008f ). 

As indicated in Fig. [5] the population of z > 1.5 
hence galaxies reaches average levels of star formation 
that enable these normal SF systems to double their 
mass within a dynamical time scale. Generally, star for- 
mation is thought to be limited by the rate at which 
cold gas is accreted onto the galaxy (e.g. iDutton et al] 
20101 iBouche et al l 120101 and also e.g. lKeres et al.ll2005l : 
Maccio et al.1 1200*61 where simulations actually show the 
cold gas inflow) while the efficiency of star formation does 

44 At high masses, the radio-derived SSFRs for SF galaxies lie 
significantly above those for all galaxies demonstrating that the 
SFD-based pre-selection is efficient. 

45 Also at high z there is evidence for the e xistence of quiescent 
systems that are predominantly massive (e.g. [U imatti ct all 120041 ; 
IKriek et all 2006. 2008; Brammcr ct al. 2003). However, as our 
spectral classification of SF systems is efficient to exclude passive 
galaxies (see 110) and as these systems are also rare we do not 
expect them to cause the observed trend. 
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Figure 5. Radio stacking based measurement of the SSFR as a function of stellar mass at 0.2 < z < 3.0 for our entire galaxy sample 
(left) and SF systems (right). Open symbols depict samples containing galaxies less massive than the individual limits denoted in Fig. 
[l] and are regarded as not representative for the underlying galaxy population and rather represent upper SSFR limits. Dashed lines 
are two-parameter fits of the form c X to the mass-representative data depicted by fille d symbols (see T ab. [JJ. The horizontal red 
band sketches the inverse dynamical time of (370 ± 50) Myr measured in local disk galaxies (Kennicutt 1998) and also found in massive 
disk galaxies at 2 ~ 1.5 (Daddi ct al. 2010b). Galaxies with such high levels of SSFR effectively double their mass within a dynamical 
time. As detailed in Sec. 14.11 this might represent an upper bound to the average SSFR. All measured data points are listed in Tab. 
[2] and [3] The derivation of error bars involves a b ootstrap ping analysis combined with the uncertainties to the best-fit model of each 
stacking-dcrived average radio source (see App. IB. 21 and Sec. 13.21 for further details). We do not account for uncertainties associated with 
the SFR-calibration, the photometric redshift and stellar mass estimates as the large number of objects stacked for each data point ensures 
that even the joint error budget is statistically reduced to a low level that would not substantially enhance our uncertainty ranges. 



Table 4 

Two parameter fits to the mass dependence of the SSFR 



Az 


All galaxies 
log(c ALL [1/Gyr]) 


/3all 


X 2 /d.o.f 


SF systems 
l°g(csFG [1/Gyr]) 


/^SFG 


X 2 /d.o.{ 


0.2-0.4 


-1.63 ±0.04 


-0.73 ±0.03 


0.11 


-1.27 ± 0.03 


— 0.44 ± 0.03 


0.03 


0.4-0.6 


-1.22 ±0.04 


-0.75 ± 0.04 


0.24 


-0.90 ± 0.03 


— 0.42 ± 0.03 


0.78 


0.6-0.8 


-0.96 ±0.08 


-0.57 ± 0.08 


0.17 


-0.67 ± 0.03 


-0.40 ± 0.03 


1.37 


0.8-1.0 


-0.81 ± 0.07 


-0.73 ± 0.06 


0.07 


-0.48 ± 0.03 


-0.38 ±0.03 


1.42 


1.0-1.2 


-0.53 ± 0.05 


-0.58 ± 0.05 


0.64 


-0.38 ±0.03 


-0.46 ±0.03 


0.61 


1.2-1.6 


-0.33 ±0.13 


-0.61 ± 0.12 


0.12 


-0.12 ±0.03 


-0.30 ±0.03 


1.08 


1.6-2.0 


0.04 ± 0.08 


-0.33 ±0.07 


1.47 


0.10 ± 0.07 


-0.41 ± 0.07 


2.21 


2.0-2.5 








0.22 ± 0.06 


-0.42 ± 0.05 


0.19 


2.5-3.0 








0.43 ± 0.16 


-0.44 ±0.15 


0.81 




(Pall) = 


-0.67 ± 0.02 


+U.34 
-0.08 


(/3SFG> = 


-0.40 ±0.01 


+0.10 

-0.06 



Note. — A power-law fit of the form c X (M*/10 n Mq)P (Eq. Q) was applied to the radio 
stacking-based SSFRs as a function of mass within any redshift slice. Fits have only been applied if 
more than two data points remained above the mass limit where the individual sample is regarded 
mass-representative. The resul ts fo r all galaxies are shown in the left half of the table while those for 
star forming systems (see Sec. 12.41 1 are given in the right half. The weighted average power-law index 
(over all accessible redshifts) found for each population is stated at the bottom along with the formal 
standard error and the scatter range yielding a more realistic uncertainty estimate. 



not appear to change out to the highest redshifts acces- 
sible to molecular gas stud i es in normal disk g alaxies 
to-date (|Daddi et al.l l2010aL iTacconi et al.l 120101 ). Con- 
sequently, even the highly elevated gas fractions - i.e. the 
amount of gas available for star formation over the sum 
of g as and stellar mass — compared to local disk systems 
(e.g. lDaddi et al.ll2010al who find up to 60 % at z = 1.5) 
might not suffice to sustain a star formation activity that 
proceeds faster than gravity permits. As average galaxies 
reach inverse SSFRs comparable to their inverse dynam- 
ical - and, most importantly, free fall - time it is hence 
likely that an effective gas accretion threshold is reached. 
Hence the SSFR should stop its growth with redshift at 



some point. Lower mass galaxies reach this threshold at 
lower redshifts than the more massive systems leading 
to the flattening of the relation we observe at the lower 
mass end. We will henceforth refer to the transition from 
an inclined to a flat SSFR-sequence as 'crossing mass'. 

It is clear that carbon monoxide ALMA-studies at 
z > 1.5 of typical SF systems with M* < 10 10 M Q are re- 
quired to understand their molecular gas properties and 
to test the star formation law of this population. 

4.3. The redshift evolution of SSFRs as a function of 

mass 

The redshift evolution of our data is shown in Fig. [6] 
for all galaxies and for the SF population. Both pan- 
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Figure 6. Redshift evolution of the SSFRs for all galaxies (left) and SF systems (right) in logarithmic stellar mass bins. Two-parameter 
fits of the form C X (1 — z) n are applied only to data points derived from samples regarded as representative for the given underlying 
galaxy population which are depicted by filled symbols (see Tab. [5}. The black long-dashed line gives the mass-doubling limit above which 
galaxies are able to double their mass until 2 = assuming a constant SFR and it therefore equals the inverse lookback-time. The black 
dashed-dotted line depicts the inverse age of the universe at any g iven redshift and hen ce makes measured SSFRs comparable to the past 
average star formation activity. The SED-derived measurement of Magdis et al. (2010) for LBGs at z ~ 3 with log(Af») ~ 10 is shown as 
a filled star. All data results are listed in Tab. [2] and [3] 



els suggest a co-evolution of the considered mass-bins 
at least out to z ~ 1.5; while all measured SSFRs in- 
crease with redshift, the high-mass end does not evolve 
faster compared to lower masses and it always has the 
lowest SSFRs. An offset between the typical SSFRs of 
different mass bins is also evident for SF galaxies but 
it is smaller than for the entire galaxy population which 
shows a wider spread of SSFRs at fixed mass. Clearly, all 
these aspects are the direct result of our previous find- 
ings: 

• A constant slope j3 of the SSFR-M* relation is ob- 
served for all galaxies (at the high-mass end) as well 
as for SF galaxies alone at least out to z ~ 1.5. 

• The slope /3sfg is shallower for SF systems. 

In Fig. [6] we also plot the mass doubling lineal (long 
dashed line) and the inverse age of the universe at any 
given redshift (dashed-dotted line). Our measurement 
clearly shows that virtually all SF galaxies display a 
higher star formation activity than is required if their 
entire mass had been build up at a constant rate over 
the whole age of the universecZI All galaxies, generally, 
cross the dashed-dotted line sooner or later depending 
on their stellar mass. The most massive systems enter 
the stage of sub-average star formation activity already 
at z ~ 0.8. 

At the high-mass end (log(M») > 11) the SSFR for SF 
galaxies increases by almost a factor of 50 and is about 
twice as much for all galaxies within 0.2 < z < 3. For a 

46 At a given redshift, the mass-doubling threshold is given by 
the inverse lookback time. A SSFR in excess of this limit is hence a 
mass-independent indicator for the potential of a galaxy to double 
its mass by z = if it were to maintain its current SFR. 

47 This does not necessarily imply that an individual galaxy 
maintains a high level of star formation activity. All statements 
we make here refer to an average galaxy of a given mass and cos- 
mic epoch having a well-defined SSFR thanks to the SSFR-Af* 
relation. Star formation might still be subsequently quenched in 
an individual system so that its evolutionary track does not need 
to coincide with those shown in Fig. \E\ 



given mass-bin the redshift evolution is well described by 
a power law g(z) cx (1 + z) n as depicted by the dashed 
lines in Fig. [61^1 For the most massive SF galaxies this 
relation holds out to the highest redshifts probed, thus 
no flattening is observed. However, towards lower masses 
and z > 2 significant deviation of the data from the best- 
fit relation with lower SSFR towards lower masses is ap- 
parent. Again the argument of an upper SSFR limit 
due to dynamical reasons might explain such a devia- 
tion. For refere nce we also show th e recent SED-based 
measurement bv lMagdis et alj (|2010f ) for a Lyman Break 
Galaxy (LBG) sample at z « 3. Their study probes 
M* w 10 10 Mq and we see that their SSFR-measurement 
is significantly below the extrapolation given by our evo- 
lutionary fit in the same mass-regime even for SF galax- 
ies. Basically, their data point is extending our mea- 
sured data at the low-mass end if evolution were to stop 
at about z ss 1.5 (a scenario suggested b y the data of 
iStark et al.ll2009l and [Gonzalez et al.ll2010f ). 

Summarizing our findings a separable function of the 
form 

SSFR(M„,z) ocf(M*) x g(z) = Jvlf x (1 + z) n (8) 

describes well the mass-dependent evolution of the SSFR 
given our data within the restrictions discussed. For SF 
galaxies we find /3sfg ~ —0.4 and tt-sfg ~ 3.5. We em- 
phasize again that the dynamical arguments discussed in 
Sec. !4.2l would give rise to a value of /?sfg = below the 
crossing mass of the average SSFR and the upper limit- 
ing SSFR. The results of the individual fits to our data 
yielding the parameters /3 and n for all and SF galaxies 
are presented in Tables [4] and [5] 

It is worth noting that at z > 1 - where angular diame- 
ter distance is approximately constant - the evolutionary 
trend we find is very close to the redshift dependence of 
the radio luminosity of about (1 + z) 3 8 (see Sec. 13. 2|) . 
As the SSFR is proportional to the radio luminosity this 

48 We fitted only data in the representative mass range. 
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Table 5 

Two parameter fits to the redshift evolution of the SSFR 



Alog(M» [Mq]) 


All galaxies 
log(C AL L [1/Gyr]) 


™ALL 


X 2 /d.o.f 


Alog(M» [M ]) 


SF systems 
log(C S PG [1/Gyr]) 


"SFG 


X 2 /d-o.f 










9.4-9.8 


-0.92 ±0.45 


3.02 ±0.15 


0.65 


10.2-10.5 


-1.53 ± 0.56 


4.18 ±0.05 


3.05 


9.8-10.2 


-0.92 ±0.33 


3.42 ±0.07 


1.63 


10.5-10.8 


-1.76 ± 0.93 


4.28 ± 0.05 


1.48 


10.2-10.6 


-1.11 ±0.28 


3.62 ±0.04 


5.24 


10.8-11.1 


-1.92 ± 1.48 


4.27 ± 0.05 


1.75 


10.6-11.0 


-1.28 ±0.47 


3.48 ± 0.04 


1.73 


> 11.1 


-2.12 ± 3.93 


4.53 ±0.07 


2.37 


> 11.0 


-1.41 ± 0.82 


3.40 ±0.06 


1.52 




<«ALL> = 


4.29 ±0.03 


+U.'2i 
-0.11 




<«SFG> = 


3.50 ±0.02 


-0.48 



Note. — A power-law fit of the form C X (1 + z) n was applied to the radio stacking-based SSFRs as a function of redshift within any mass bin. Fits have 
only been performed if more than two data points remained above the mass limit where the individual sample is regarded mass-rcprcscntativc. The results for 
all galaxies are shown in the left half of the table while those for star forming systems (see Sec. \T~T\ are given in the right half. The weighted average power-law 
index (over all accessible masses) found for each population is stated at the bottom along with the formal standard error and the scatter range yielding a more 
realistic uncertainty estimate. 



is yet another argument to support that our inferences 
are not challenged by systematic errors to the median 
redshift even in our broader bins at z > 1 (see also the 
corresponding discussion in Sec. 13. 2[) . 

In order to alternatively probe our inferences in the 
redshift range below z — 1.5 where our data yields 
the most robust results we also stacked the same bins 
in redshift and mass into the Spitzer 24 and 70 /im 
COSMOS maps. We inferred SFRs from the total (8- 
1000 (j.m ) IR luminosity pred icted by the best-fitting 
IR SED ((Chary fe Elbazj|200l given the joint flux den- 
sity information. The results do not deviate significantly 
from those derived from the radio emission so that all our 
conclusions remain robust also when derived from the IR 
data. All these and further results will be presented and 
discussed in detail in a separate publication (Sargent et 
al., in prep.). 

4.4. Comparison to other studies 

In this section we compare our findings with results 
in the literature, with a particular focus on those least 
dependent on extinction corrections because they use ei- 
ther radio stacking or stacking of IR imaging by Spitzer 
and, most recently, Hcrschcl. Literature data we show 
in the Figures belonging to this Section are based on a 
Salpeter IMF and have been converted to the Chabrier 
scale. 

The evolutionary power law we derived for all mass- 
selected galaxies i s in excellent agreeme nt with the re- 
sults presented by Damen ct al. (2009a), both in terms 
of the evolutionary exponent and the normalization of 
the trend. We hence concur, in particular, with those 
findings of iDamen et al.l ()2009al ) resulting from a de- 
tailed comparison of their re sults with prediction s from 
the semi -analytical model of IGuo fc White! (|2008ft . The 
study of IDamen et al.l (|2009al) . which is based on SFRs 
from 24 /im and UV detections in conjunction with deep 
Jsf-band observation in the Chandra Deep Field South, 
is also in broad agreement w ith the 24 /im stacking anal- 
ysis of iZheng et al.l (|2007al) at z < 1 in the same field. 
Consistent findings have also been presented recently in 
th e Spitzer /MIPS sta cking analysis at 70 and 160 /im 
bv lOliver et al.1 (|2010T > whose data covers the largest on- 
sky area of all aforementioned surveys, albeit at a re- 
duced depth of F3.6 fj, m — 10 /iJy (an order of magnitude 
shallower compared to our sample) which prevents them 
from reliably constraining the evolution beyond z ~ 1. 
Based on a deep rest-frame NIR bolometric flux den- 



sity selected galaxy sample in the north ern Great Obser- 
vatories Or igins Deep Survey (GOO DS IGiavalisco et al.l 
120041) field ICowie fc Bargerl (|2008l) measure extinction 
corrected UV-based SSFRs for all individual objects out 
to z — 1.5. Their average trends with mass agree well 
with our results for all galaxies in the comparable red- 
shift ranges and also on absolute scales both studies are 
consistent at all masses. 

Radio-based measure ments of the S SFR-M * relation 
have been presented by iDunne et al.l (|2009f) . In terms 
of the evolution of the SSFR-sequence both their and 
our study show a good agreement. The findings by 
IDunne et all ([2009) differ from ours (see Fig. [7]) as well as 
from most other studies not restricted to SF galaxies only 
in that they report an almost non-existent slope /?all at 
all reliably probed epochs. Their analysis and ours share 
some methodological similarities (e.g. the use of a mass- 
(in their case: K-) selected sampl^H and a radio stacking 
approach) and should therefore be directly comparable. 
Despite the technical differences in the exact implemen- 
tation of the image stacking as already discussed (see Sec. 
13.21) it seems unlikely that an explanation for the differ- 
ent trends can be found in the radio data used. It ap- 
pears more likely that the deriva tion of individual st ellar 
masses causes the differences as IDunne et al.l (|2009T ) use 
a direct conversion from the rest-frame K-band magni- 
tudes as measured from the best-fitting SED templates to 
stellar mass which exclusively depends on redshift. Such 
a conversion s hould not only be diff erent for SF and qui- 
escent sources (|Arnouts et al.|[2007l) but even ceases to be 
applicable towards lower mass SF sources as discussed in 
App. D of 110. It is hence likely that low-mass SF sources 
with higher SSFRs have migrated to higher masses pro- 
ducing artificially elevated SSFRs at the high-mass end. 
This explanation is consistent with the generally higher 
deviations from our r esults at higher mass es (see Fig. [7]) . 
Since neither we nor IDunne et all ([2009) find a signifi- 
cant evolution of the slope /3all in the SSFR-M* plane 
the pure evolutionary behavior reported in both studies 
is largely consistent. 

The SSFR-M* relation for sBzK - and hence SF K- 
band sele cted - galaxies i n the COSMOS field was de- 
rived by IPannella et al.l (|2009l ) based on radio stacks 
from the same VLA image. Our results for SF galax- 
ies are (necessarily) in good agreement with their find- 

49 It is, however, worth noting that our number statistics are 
larger by about a factor of four. 
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Figure 7. Left Panel: Compa rison of our results (dashed and solid lines of different c olor) of the mass dependence of t he SSFR for SF 
systems at different rcdshifts to Dunne et al. (2009); grey diamonds, Elbaz ct al. (2007); red circles at z ~ 1, as well as Pannella et al. 
(2009); blue squares at z ~ 2. The mass limits above whic h our sample is rep resentative are denoted as black dashed lin e s in th e upper 
left. Right Panel: The corresponding measurements by Oliver ct al. (2010); grey shaded bands and Rodighicro et al. (2010a); black 
diamonds at various r edshifts along with our results (color bands with mean redshift scale at the right hand side). The Herschel/PACS 
based SSFR-scquencc (Rodighicro ct al. 2010a) sugges ts a mild 'upsizing' trend and appears to stop its evolution at z > 1.5 where the only 
(apparent) deviation from our data occurs. See Sec. 14.41 and Appendix \U\ for a discussion of effects introduced by selection biases. For 
immediate comparison our data are shown as open symbols in both panels, rebinned in z in order to cover th e sa me range in redshift as 
each referenced study. The inverse horizontal red band sketches the inverse dynamical time as detailed in Sec. 14.11 



ings at z w 2 (left panel of Fig. [7J where they do not 
probe the hi ghest mass-range pres ented here. A main 



conclusion of iPannella et al.r(|2009f) is, however, a mass 



independent SSFR at z > 1.5 which is mainly inferred 
from a measurement on their entire SF sample (not fur- 
ther divided by redshift) and a measurement at z w 1.6 
both covering the same mass-range as considered here. 
A similar t endency at z ~ 2 has previously also been 
reported bv lDaddi et~aTI (|2007| ) in a study carried out in 
the GOODS fields. Their work is based on 24 /xm de- 
tected galaxies down to log(M*) ~ 9.5 but also based on 
radio stacks of their K-band selected sample. As galaxies 
at 1.3 < z < 1.5 substantially co ntribute to the photo - 
metric redshift distribution of the IPannella et ahl (2009) 
sample, it is likely that the sBzK criterion no longer se- 
lects all SF objects at these low redshifts. In this context 
we also refer to Appendix [C] where the upper left panel 
in Fig. [14] shows that the sBzK criterion by construc- 
tion fails to select all SF sources at z < 1.5. As we 
already pointed out, our SSFR-M* relation for our SF 
sample tends to flatten towards lower M* . When consid- 
ering only low to intermediate masses, all measurements 
based on stacking into the VLA-COSMOS 1.4 GHz map 
are thus in good agreement. The steeper slope Psfg of 
the SSFR-M* relation for SF galaxies found in this study 
is thus a consequence of the fact that we span a larger 
mass range at z ~ 2. 

The left pa nel of Fig. [7 ] also sh ows the results at z » 1 
presented by lElbaz et al.l (|2007j ) based on 24 fim detec- 
tion resulting from deep Spitzcr/MIPS observations of 
the GOODS fields and UV-corrected SFRs. Although 
this study too infers a nearly constant relation between 
SSFR and mass, the figure shows that the radio-derived 
results agree with the mid-IR measurements remarkably 
well. This illustrates that measurements of the slope 
Psfg °f the SSFR-M* for SF galaxies are quite sensitive 
to deviations at the edges of the mass range even if mea- 
surements at individual masses do not significantly differ 



between different studies. Finally, it is also worth not- 
ing that, towards lower redshifts, our slope /3sf g agrees 
well with the measurements by iNoeske et al.l (|2007bl ) 
which are based on SFRs from emission lines, UV as 
well as 24 /xm imaging for a K-band selected sample 
of the DEEP2 spectroscopic survey. Also in the local 
universe GALEX/UV-based values around /3 = —0.35, 
consistent with our study, have been reported for galax- 
ies taken from the Sloan Digital Sky Sur vey (SDSS) 
(jSalim et alJl2007t ISchiminovich et alJl2007t ). It should 
be menti oned that, based on the S PSS emission lines 
study of iBrinchmann et alJ ([20041 ) . lElbaz et al.l ([20071 ) 
found a slightly shallower slope /?sfg = —0.23 for SF 
galaxies in the local universe. 

Image stacking results using Hers chel/PACS data 
at 160 /xm have been presented by iRodighiero et all 
(|2010aD . Their GOODS-North Spitzer/IRAC data is 
only slightly shallower compared to our COSMOS imag- 
ing. As the ri ght panel of Fig. [ 7 1 shows , individual mea- 
surements bvFRodighic ro et aLl(|2010al ) at z < 1 are in 
good agreement with our findings. (The one exception 
being their lowest redshift which extends to z = 0, ex- 
pla ining the overall slightly lower SSFRs.) At z > 1.5 
the IRodighiero et all (|2010a| ) results suggest that SSFRs 
cease to grow further at the high-mass end. While in 
this redshift range our radio-derived SSFRs agree with 
the far-IR based ones at the lowest masses probed, the 
radio measurement yields about 0.4 dex (i.e. signifi- 
cantly) higher SSFRs at the high-mass end as they do 
not show a different redshift trend than at lower z. As 
our highest redshift bin is centered at a slightly higher z 
compar ed to the corresponding one of [Rodighicr o et al.l 
(|2010a|) the difference might be slightly lower if the bins 
were perfectly matched and given the high-mass SSFR- 
evolution for SF galaxies is continuing at z > 1.5. We 
therefore see no clear evidence for strong discrepancies 
of radio- and far- IR stacking deriv e d SSFR s at high z 
as speculated by IRodighie ro et al.l (|2010af) when com- 
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paring their results t o those of iPannella et all (|2009l ) 
and especially those o fDunne et al.l ( 20091 ) . We empha- 
size, however, that future far-IR studies could test po- 
tential mass-dependent changes in the radio-IR corre- 
lation at z > 1.5 responsible for the slight differences 
reported herePI Based on power-law fits to their data 
iRodighiero et al.l (|2010aD infer a steepening of /?sfg to- 
wards higher z, an effect they consequently term 'up- 
sizing'. Note, howe yer, that our measureme nts of /3sfg 
agree with those of IRodighiero et all (|2010al ) within the 
uncertaintie s. Tentative evide nce for upsizing is also 
reported bv lOliver etaLl (|2010f ) who use Spitzer/MIPS 
stacking of late-type galaxies at 70 and 160 fj,m (see right 
panel of Fig. [7]). In Appendix ICl we show that we can 
mimic an upsizing trend, as well as the somewhat flatter 
evolution of the SSFR o ut to z 2 reported by, e.g., 
Rodighicro eTall (|2010aD if we restrict our SF sample to 
those sources with the most active star formation. 

5. THE RADIO-BASED COSMIC STAR FORMATION 
HISTORY 

Based on our measurement of radio-derived SFRs as a 
function of mass we directly derive accurate SFR densi- 
ties (SFRDs hereafter) for SF galaxies above the limiting 
mass at z < 1.5 and further constrain the CSFH out to 
z = 3. We will also introduce two alternative extrapola- 
tions to low-mass objects that we do not directly observe. 
Because of our generally low mass limit, the impact of 
the extrapolation - especially out to z = 1.5 — to these 
faint galaxies is small compared to most other studies. 

5.1. The mass distribution function of the SFRD at 
fixed redshift 

At a given redshift and mass, the SFRD is computed 
as the product of (i) the comoving number density as 
inferred directly from the number of galaxies in the rele- 
vant mass bin and (ii) their average SFR as measured in 
our stacking analysis. 

As already pointed out, SFRs likely represent an up- 
per limit at the smallest masses where the sampling of 
the underlying population is not representative. Conse- 
quently, SFRDs at low masses are also upper bounds, as 
we can correct the number counts in a given low-mass 
bin for the lost objects. This is done by computing the 
expected number from the observed mass functions de- 
rived for the same sample of SF galaxies that is used for 
this study (see 110 for further details). We account for 
the slightly smaller portion of the COSMOS-ficld acces- 
sible to the radio-stack compared to the area used for the 
derivation of the mass-functions. The correction for the 
expected number counts is always small so that corrected 
and uncorrected values of SFRD(M„, z) agree within the 
errors. Since it is a systematic correction it still needs to 
be taken into account. 

All number-count corrected and uncorrected data 
points for the SFRD(M*,,z) are shown in Fig. [H 

50 Herschel/PA CS observations of the GOODS-North field 
IIElbaz et al. 2010) revealed that in the same redshift regime the 
total (8 — 1000 /an) IR luminosity appears to be overestimated 
when the IR template-SED fit is constrained by a single 24 /an 
measurement. The deviation starts at Ltr ~ 10 12 Lq and grows 
with increasing Ltr,, SFR and consequently mass as these quan- 
tities are correlated. It is therefore necessary to test the radio-IR 
correlation in the proposed way using far-IR data. 



There appears to be a characteristic mass of M* = 
10 io.6±o.4 Mq that con t r ibutes most to the total SFRD 
at a given redshift. Up to z ~ 1.8 our data points sample 
below this characteristic mass and the peak is well con- 
strained. At higher redshifts this is no longer the case. 

We want to motivate now that the underlying func- 
tional form for the distribution of data points in the 
SFRD-M* plane is actually known because of two facts: 

1. There is a (possibly broken) power- law relation be- 
tween (S)SFR and stellar mass for SF galaxies at 
all z < 3 as measured in this study. 

2. The functional form of the mass function for SF ob- 
jects in the same redshift range is well determined. 

Regarding the second poi nt, the mass fun c tion of SF 
galax i es is c o mmonly (e.g . iLilly et all Il 995: Bel l et all 
|200l [20071 iZucca et all 120061: lArnouts et all 120071: 
iPozzetti et al.1 120091 : lllbert et al.l 120051 12010ft found to 
be well parametrized by a power law with an exponen- 
tial cutoff at a c haracteristic mass M* as introduced by 
iSchechterl (|1976l ) of the form 

$ SFG (M„)dM, = $ SFG (M»/M S * FG ) QSFG (9) 
x exp (-M*/M S * FG ) d(M*/M* FG ). 

This Schechter function has recently been qualitatively 
as well as quantitatively been modeled to be the natural 
consequen ce of essentially tw o types of cessation of star 
formation (jPeng et al.| [2010)Fl 

Multiplying $sfg(M*) by the SFR-sequence, i.e. an- 
other power-law in mass, again produces a Schechter 
function. Hence we can write 

SFRD(M»,z)dM* = $ SFRD ($ SFRD ,5,M*)dM*, (10) 

i.e. a distribution (SFRD function hereafter) of the same 
functional form as Eq. (|9|) with the three parameters 
$g FRD , a and M*. While the exponential cutoff mass 
M* is the same as the one in the mass function (defined 
above as -Mg FG ), a = asFG + Psfg is the sum of the 
low-mass slope of the mass function of SF galaxies and 
the s lopeF^I of the SFR-sequence (see also iSantini et al.1 
2009, for a similar parameterization). The parameter 
$g FRD acts as a normalization and its role in the global 
evolutionary picture will be discussed in Sec. 15.21 

The index «sfg and also the cutoff mass M* 
for SF galaxies are constant in the r edshift regime 
consi d ered (e.e. iBell et al.l 120031 120071: lArnouts et all 
1 20071: iPerez-Gonzalez et al.l l2008t IPozzetti et all 120091: 
lllbert et al.ll2010l ). We assume here that «sfg and -^sfg 
stay constant also at z > 2. These assumptions are ten- 
tatively supported by the few observational constraints 



51 [Peng et al. 1)20101 ) refer to these two processes as 
'environment-' and 'mass-quenching'. The former one is likely to 
be explained by star formation being shut off in satellite systems as 
soon as galaxies fall into larger dark matter halos while the latter 
one is a continuos process stopping star formation within galaxies 
above the characteristic mass ALtp G at a rate proportional to their 
SFR. In the following we will make use of evolutionary constraints 
on the mass function parameters agpo, ^sfq an d ^SFG ^ or ^ 
galaxies in particular. Their trends are not only supported by re- 
cent literature (see the further discu ssion in this Secti on) but also 
naturally contained in the empirical IPeng et al.l l|2010P ) model. 

52 Please note that /9gpG denotes the slope of the SFR-M* re- 
lation for SF galaxies which is connected to the slope /3sfG of the 
SSFR-sequence (see Sec. 14. II and Tab. 2J| by /3sfg = Psfg + 1- 
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Figure 8. The distribution of the SFR density (SFRD) with respect to stellar mass, as measured at various epochs out to z ~ 3. In each 
panel the data points have been derived by multiplying the observed number densities of SF galaxies with the average (stacking-based) 
radio SFRs. Below the limit where our da ta is regarded mas s-representative - depicted by red dashed vertical lines - number densities have 
been corrected using the mass functions (Ilbcrt et al. 2010). The uncorrected data are shown for comparison as open circles suggesting 
that these corrections are generally small and no corrections are needed at z < 1.5. For 0.8 < z < 1.2, the [Oil] A3727-derived SFRDs 
(from[Gilbank et al. 2010b); open diamonds, resc aled by a constant facto r of two to match our data agree well with the trends in our data. 
The same holds true for the UV-based results by Cowie & Barger (2008) depicted at 1.2 < z < 1.6 for which no rescaling was necessary. 
Note that their data was derived over a broader range in redshift down to z = 0.9. As our data suggest globally only a mild evolution 
between 0.9 < z < 1.4 the comparison depicted is justified. In each panel we overplot the Schechter function that results from multiplying 
the best-fit radio derived SSFR-sequence at a given epoch with the corresponding mass function for SF galaxies. The uncertainty range is 
obtained by choosing the two sets of Schechter parameters within their error margins that maximize /min imize the integral. Dashed blue 
lines show the distri butio n obtained if an upper limit to the average SSFR at lower masses (see Sec. 14.21 for details) is assumed (referred 
to as 'case B' in Sec. 15.11 1. All literature data plotted here have been converted to our Chabrier IMF. 

of two other studies that rely on different SFR trac- 
ers. At z ~ 1 the dependence of the SFRD on stellar 
mass has recently been mea sured using the [OII1 A3727 
line to trace star formation (Gilban k et al.ll2~010b| ). We 
over-plot these data points in the corresponding redshift 
bins in Fig. |8] and find that our SFRD function ac- 
curately fits these measurements as wellEEl The same 
holds for the UV-derived results based on a Salpeter 

53 These data are based on a Baldry & Glazebrook IMF and 
have been converted to the Chabrier scale. An additional rescaling 
by a constant factor of two was necessary in order to match our 
calibration. This is in agreement with the results by Gilb ank et al.l 
l|2010bl ) that show SFRs based on practically all probed alternative 
tracers to be in excess of the [OII]-derived ones. They discuss 
possible explanations for this deviation. Given the well known 
global uncertainty in the absolute calibration of SFR tracers this 
deviation is, however, not significant. 



reported for these high redshifts as detailed in Sec. 
As detailed in Sec. 14.11 the power-law index /3sfg ~ that 
enters the parameter a = ctsFG + Psfg + 1 hi Eq. [TU] - 
is also found to be a constant. However, we explained in 
Sec. 14.21 that at masses lower than the crossing mass be- 
tween the SSFR-sequence and a possible SSFR-threshold 
/?sfg = should be assumed. In the following we will 
hence consider two possible scenarios below the suggested 
crossing mass at a given redshift: 



Case A 
Case B 



a = asFG + /?SFG 
a = «sfg + 1- 



1 



Fig. [S] shows that at z < 1.5 the parameterization of 
the SFRD function in Eq. [TU] can reproduce our data at 
all masses sampled and irrespective of the exact value of 
a. For this redshift range Fig. [8] also includes results 
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Table 6 

Schechter parameters for the stellar mass function of star 
forming galaxies 





a SFG 




SFG 

[10 -3 Mpc" 3 dcx" 1 ] 






[Mq] 


0.2-0.4 


-i no+0.01 

1 - oz -o.oi 


1 1 on+ 003 

ll.UU_ 03 


1 1 k+0.08 
1 - lo -0.07 


0.4-0.6 


i o 9 +0.01 


11 04+ ' 03 


70+ ' 05 
u ' ' u -0.04 


0.6-0.8 


i o 9 +0.01 

^•^-o.oi 


10.95±°$ 




0.8-1.0 


-1 16+ ' 01 
± - 1D -o.oi 




i OO+0.06 
J -°°— 0.06 


1.0-1.2 


_1 1Q+O-02 
± - la -0.02 


10 92+ 02 
iu.yz_ Q 02 


94+ - 05 


1.2-1.5 


i OO+0.02 
-l-2»_ 02 


10 91+ - 02 
iu.yi_ 02 


n fi«+ - 03 

U - b8 -0.03 


1.5-2.0 


-1 29+ - 02 


10 96+ 02 
±u.yo_ 02 


46+ 02 


2.0-2.5 


, 2 q+0.03 


in Qt;+0.03 
iu.yo_ 03 


n on+0.06 


2.5-3.0 


_j 2 q+0.03 


in 95+ 03 
iu.ya_ 03 


n 97 +0.05 



Note. — At z < 2 all param 
wc assume a non-evolving shape 
the average of the respective lo 
matching the number densities ti 
regime of our data. 



crived by 110. At z > 2 
d Mppp arc taken to be 



-*SFG ana "SFG 
lues. was 



rived by 



IMF by ICowie fc Bargerl (|2008f) in the GOODS-North 
field at 0.9 < z < 1.5. Given our results, the global 
evolution of the SFRD-function between 0.9 < z < 1.4 
is mild such that we can plot these data in Fig. [5] in 
the bin 1.2 < z < 1 .6. It is worth noting that the 
ICowie fc Bargerl (|2008[ ) measurements at z < 0.9 equally 
support our finding that the peak of the SFRD does not 
shift with redshift to higher values. 

Below the limiting stellar mass (dashed red lines in 
Fig. [5]) our data points are lower than the prediction 
of Eq. [TO] if we assume case A for a (even though we 
have applied a number density correction). Moreover, 
we remind the reader that - in keeping with our previ- 
ous discussion - these data points are likely upper limits. 
Given the comparatively large uncertainties of our SFRD 
functions at high z these deviations are not highly sig- 
nificant but the trend is systematic and suggest a steep- 
ening of the low-mass slope of the SFRD function. It 
is directly related to the fact that the corresponding 
data points deviate from the best-fit (S)SFR-M* rela- 
tion at lower mass. In Sec. 14.21 we proposed an upper 
limit to the average SSFR due to dynamical reasons as 
a possible explanation for the trends. Taking into ac- 
count this limit of SSFR = 1/Vdyn ~ 2.7 Gyr -1 yields 
an index /3sfg = below the mass at which our fitted 
high-mass SSFR-M* relation crosses the supposed SSFR 
limit at a given epoch. We plot the SFRD function for 
a = «sfg + 1 as dashed blue lines in Fig. [5] As the 
crossing mass increases with redshift and lies below the 
mass-representativeness threshold at z < 1 it has lit- 
tle impact on the mass-integrated SFRD. The reason is 
that the mass-dependent SFRD has declined already by 
at least an order of magnitude from the peak value before 
reaching the crossing mass. The changes are largest at 
z > 1.8 where the dashed SFRD function (case B) drops 
quickly towards lower masses right after the peak and, 
in doing so, traces our data points better than before 
for a — asFG + Psfg (case A). However, we emphasize 
that our data cannot clearly favor case A or B proposed 
given the large uncertainties of the Schechter parameters 
and the lack of representativeness of our data at such low 
masses at high z. Any scenario suggested, hence, awaits 
confirmation based on deeper data in the selection band 
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Figure 9. The Schechtcr-function description of the SFRD (re- 
ferred to as SFRD function) for z < 2.5, with color denoting the 
redshift range. This plot combines the information in the preced- 
ing Fig., now displayed with a linear scaling of the y-axis. The 
dashed vertical lin es of different colo rs show the Schechter param- 
eter M£ f q (from Ilbert ct al. 2 0101 ) for different redshifts. It is 
found to be nearly redshift-independent for SF galaxies. The rep- 
resentativeness limit of our SF sample is below A-fgpQ at all epochs 
as indicated for the highest redshift bin (2 < z < 2.5; dash-dotted 
vertical line). Thick dashed lines of different colors depict the mod- 
ified low-mass end trends for our SFRD functions (referred to as 
'case B') that result from assuming an upper limit to the average 
SSFR of SSFR ~ l/r dyn . Here, r dyn ~ 0.37 Gyr is the dynamical 
timescale for normal SF galaxies (see Sec. I4.2l for details). This plot 
clearly shows that the characteristic mass of SF galaxies - which 
always lies below Mg FG - does not evolve over time. This con- 
clusion holds regardless of the exact shape of the SSFR-scqucncc 
at lower masses. Therefore, our data exclude a scenario in which 
the peak of the SFRD function has shifted to lower masses at later 
epochs. 

once they are available. We robustly conclude that a 
single Schechter function is a good model for the SFRD- 
function over all masses at least out to z ~ 1 and that 
the distribution of SFRDs peaks in the same mass-range 
at all z probed in both case A and B. 

Fig. [9] shows the time evolution of our above con- 
structed SFRD function. This non-logarithmic plot 
clearly illustrates the existence of a characteristic mass of 
star formation at all epochs although it should be kept 
in mind that our results are most robust at z < 1.5. 
Our findings exclude an evolution of this characteristic 
mass towards lower values with cosmic time. At z < 1.5 
this important result is supported by independent obser- 
vations at different w avelengths (|Cowie fc Bargerl [20081: 
IGilbank et al.l l2010al | H) and, towards z « 2, also by 
empirical arguments (TPeng et all 120101 ) while the recent 
model of Boi ssier et al.1 (120101) predicts a mild evolution 
of this characteristic mass by about 0.3 dex. In Fig. 
[9] again dashed lines of corresponding color denote the 
low-mass trends of the SFRD functions once we assume 
the SSFR-limit discussed. As pointed out above, within 
the important mass-range above 10 10 M Q - at which the 
SFRD function peaks at any epoch - a mass- independent 
proportional increase is apparent even out to z > 2. In 
contrast to that, the mass-dependent SFRD appears to 
evolve mildly below 10 9 M Q . As a result galaxies in the 
stellar mass range between 10 10 — 10 11 M Q contribute 
most to the global - i.e. mass-integrated - SFRD at any 
epoch but low-mass systems gain more relative impor- 
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tance towards low redshifts. Evidently, low-mass systems 
show the same evolutionary trends as those of higher 
mass if case A is assumed. 

Earlier observational findings appear to be at odds 
with the existence of a characteristic stellar mass for star 
formation we measure. While they are all based on shal- 
lower survey data compared to our COSMOS imaging, 
the limiting magnitude of these samples is not necessar- 
ily the reason for the different conclusions. Uuneau et al.l 
(2005) - using a mass-independent extinction correction 
and hence a linear conversion from [Oil] luminosity to 
SFR - report that out to z ~ 2 the contribution to the 
increase of the global SFRD is more rapid, the more mas- 
sive the galaxies are. While this is a similar trend we find 
if we assume an upper limit in SSFR (case B), their low- 
est stellar mass-bin - centered at 10 96 Mq - always ap- 
pears to contribute most to the SFRD integrated over all 
masses. Hence, there is no clear evidence for a peak in the 
SFRD mass-distribution function and certainly not in the 
higher mass-range our data supports. H owever, based 
on the i r findings in the loca l universe (Gilban k et al.1 
I2010al) . iGilbank etldl (|2010bf ) empirically showed that 
a mass-dependent calibration between [Oil] luminosity 
and SFR is more appropriate. If true, this would modify 
th e low the low - mass dominance in the derived SFRDs 
of lJuneau et al.l (|2005l ) to correspond more closely to our 
result 

Independently, also the radio-stacking based study by 
iPannella et al.l fj2009f ) suggests a strong dependence of 
the dust-attenuation correction on stellar mass a t highe r 
(z > 1.5) redshift s. Similarly to Uuneau et al.l (|2005), 
iBauer et al.1 (|2005l ) derive their [OII]-based (S)SFRs from 
a mass-independent calibr ation albeit neglectin g extinc- 
tion corrections^ Finally, Bundy et al. (200(3) , who fa- 
vor a shift of the characteristic mass towards lower val- 
ues with cosmic time, entirely base their conclusion on 
the stellar mass functions they derive for SF galaxies (se- 
lected by rest-frame (U-B) color and [Oil] equivalent line 
width) within the DEEP2 survey sample. At z < 1.4 
these mass functions do not show an evolution of the 
faint-end slope, but the Schechter parameter M^ FG ap- 
pears to decrease with cosmic time, in contrast to the al- 
ready discussed broad agreement in the more recent liter- 
ature according to which neither osfg nor -^sfg change 
in this redshift range. 

To conclude, we summarize our findings as follows. 

1. Up to normalization, the mass distribution func- 
tion of the SFRD is a universal Schechter func- 
tion at z < 1 with possible deviations only below 

10 8 Mq. 

2. We explain this surprising constancy in shape of 
this SFRD-function by the non-evolving slope of 
the SFR-sequence and the constant shape of the 
mass function for SF galaxies. 

3. Our data at z < 1.5 clearly disfavors a strong 

54 Indeed, as Gilbank et al. ( 20102) point out, the mass- 
dependence here is not only introduced by dust extinction but 
results from an interplay of various additional factors, e.g. the 
mctallicity or the ionisation parameter. 

55 In the context of mass-dependent evolutionary effects not con- 
sidering any extinction correction is equivalent to considering a 
mass-uniform one. 



'downsizing' scenario in which the characteristic 
mass of SF galaxies that contribute most to the 
overall SFRD - integrated over all masses at a 
given time - shifts towards lower values over cosmic 
time. The situation does not appear to change even 
at z > 1.5 where, however, deeper data is needed 
to confirm our results. The characteristic mass is 
M* = 10 10 - 6±0 - 4 M Q . 

4. If we assume an upper limit to the average SSFR 
of the order of the inverse dynamical scale (rd yn ~ 
0.37 Gyr) the SFRD of galaxies less massive than 
10 9 Mq evolves less rapidly than the one of the 
dominant higher mass range above 10 10 Mq. 

5.2. The evolution of the SFRD 

As all introduced parameters show this remarkable 
constancy throughout the redshift range probed (at least 
out to z w 1) the redshift dependence of Eq. (fTTTj) is en- 
tirely contained in the normalization •I'gpj^rj. Eq. (|10p 
becomes a separable function so that we rewrite: 

SFRD(M*,z)dM* = $sfrdO) V M*) dM» , (11) 

where mass-dependence consequently is solely contained 
in the universal SFRD function ip. The global SFRD at a 
given redshift, integrated over all masses, is simply given 
by 

SFRD(z) = $ SFRD (z) J ^(a,M*)dM«. (12) 

In the following we will motivate that the evolution of 
the integrated SFRD follows a simple power-law of the 
form 

$!frd(*) [A/ Q /yr/Mpc 3 ] oc (1 + *) n *feo+ n *»«, (13) 

where the two power-law indices result from the change 
in stellar mass density contained in SF galaxies and the 
increase of the (S)SFR-sequence with redshift. 

As detaile d in IIP and previously also found by other 
studies (e.g. IBell et al.l 120031 [20071 lArnouts et al.l 120071: 
iPozzetti et al.ll2009[ ) the stellar mass density of SF galax- 
ies grows after the Big Bang only until z ~ 1. At lower 
redshifts it stays constant. Consequently, as the shape 
of the mass functions does not evolve, also the Schechter 
parameter 'fgpQ in the mass function is constant in this 
redshift regime, apart from fluctuations due to large scale 
density fluctuations. As shown in Fig. [TU1 these fluctua- 
tions are cons i stent w ith cosmic variance as estimated by 
iScoville et all (|2007b[) and detailed in 1100 It is clear 
that cosmic variance effects are strongest at low redshifts 
as the effective volume sampled in a redshift bin with 
Az = 0.2 increases with redshift. In the interest of sim- 
plicity and to avoid systematic errors caused by cosmic 
sampling variance we adopt a constant ^gpQ at z < 1. 

If we therefore set n<s>* FG (z < 1) = in Eq. (|T3| the 
evolution of the integrated SFRD in the range < z < 1 
is entirely described by the global, i.e. mass-uniform, de- 
cline of the average SFR of SF galaxies with cosmic time. 
It is important to emphasize once again that this strong 
decline is definitely not caused by a decreasing number of 
SF galaxies, in particular not at all at the high-mass end. 

56 For a detai led discussion o n cos mic variance in the COSMOS 
field we refer to lMeneux et al.l pOOl V 
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Figure 10. The redshift dependence of the Schechter parameter 
<3>g FG of the mass functions for SF galaxies as derived in the COS- 
MOS field (110). At z < 1 the normalization ^gpg behaves like 
the other Schechter parameters and stays constant except for fluc- 
tuations very likely cause d by cosmic variance. An estimate of the 
cosmic variance error (sec Scovillc et al. 2007b, for further details) 
is added in quadrature to the error of the Schechter fit at these red- 
shifts, as indicated by dotted error bars. At earlier epochs the mass 
build-up is reasonably well characterized by a power law in (1 + z). 
This trend is consequently also seen in the evolution of the stellar 
mass density of SF galaxies (110). The two highest redshift points 
(open symbols) were obtained under the assumption that the mass 
functions keep their shapes also at earlier cosmic times by match- 
ing the normalization to the numbers of galaxies observed in our 
mass-representative bins at the high-mass end. This assumption is 
supported by the direct observational constraint of from the 

best-fit S chechter-paramet e rizati on of the total mass- function mea- 
sured by Marchesini et al. (2009) which - at these high redshifts - 
should be representative for the SF population; red diamonds. 

As the efficiency of star formation wi thin SF systems ap- 
pears not to change over cosmic time ([Daddi et al.ll2010a ; 
iTacconi et all |2010), we conclude that below z ~ 1 
the strongly evolving integrated SFRD must be exclu- 
sively caused by a strongly declining mass density of 
cold molecular gas available for star formation towards 
the local universe. Recent theo r etical model predict ions 
(|Obreschkow fc Rawlingj I2009t IDutton et all 12010ft in- 
deed support such an evolutionary behavior of the cos- 
mic mass density of molecular hydrogen. As we already 
explained in Sec. 14.31 systematic redshift uncertainties 
will not influence our conclusions either as they would 
only be propagated along the redshift trend inferred. 

At earlier epochs (i.e. z > 1) we monitor the redshift 
dependence of the parameter $gpQ as measured by 110 
(Tab. H]). Fig. [TU1 shows that the above choice of a power 
law at z > 1 is a reasonable assumption and we find 
n#« (z > 1) = -2.38 ± 0.02 from a fit to our COSMOS 

V SFG y ' 

data points at 1 < z < 3. It should be mentioned that 
the two highest redshift points in Fig. [10] (open circles) 



were obtained by assuming that asFG arL d M£ PG stay 
constant also at z > 2 where 110 did not directly mea- 
sure the mass function. The normalization "J'sfg^ > 2) 
was therefore obtained by matching the number densi- 
ties to those observed in the mass-complete regime of 
our data when fixing the parameters asFG and M| FG 
to their average values at z < 2. This extrapolation is 
su pported by the total mas s function at z > 2 measured 
by iMarchesini et al.l ([2009D . At these high redshifts we 
assume the total mass function to be representative for 
the SF population as quiescent galaxies are not expected 
to signifi cantly contribute to the number density. The 
study by IMarchesini et al.l (|2009ft was carried out based 
on data taken in various survey fields for which deep NIR 
imaging is available allowing them to estimate Schechter 
parameters for the total mass function also in two high-z 
bins in the ranges 2 < z < 3 and 3 < z < 4. Within 
the errors all our extrapolated Schechter parameters at 
2 < z < 3 are in good agreement with their results. In 
particular the exponential cutoff mass at 2 < z < 3, 
M£ LL = Mg FG = 10.96, they finc(EI] agrees remarkably 
well with our assumption (see Tab. |6|). Their normal- 
ization 'I'all = ^sfg does not deviate significantly from 
our prediction and the evolutionary trend we suggest is 
also supported by their data (see Fig. [TU]). It is clear, 
however, that future measurements of the stellar mass 
function at z > 1.5 - based on deeper NIR or mid-IR 
data - are critical to confirm the validity of our assump- 
tions. Based on the currently available data we conclude 
that the stellar mass build-up in the SF galaxy popula- 
tion inevitably leads to a shallower decline of the inte- 
grated SFRD between 3 > z > 1 compared to the steep 
decline between 1 > z > 0. 

In order to validate our parameterization of the evolu- 
tion of the integrated SFRD we proceed as follows. First, 
we simply add up all SFRDs measured in different mass 
bins in a given redshift bin (all data points shown in 
the corresponding panels in Fig. |5|) obtaining a lower 
limit to the integrated SFRD at that epoch. Second, 
in order to account for the contribution of the low-mass 
(log(M*) < 9.5) SF population that we cannot directly 
measure, we integrate Eq. (|10j) from our mass-limit down 
to 10 5 Mq. As we discuss in Sec. I5.1l a single value of the 
index a in Eq. (|10j) might not be valid over the entire 
low-mass range as the SSFR-M* relation might flatten 
as soon as an upper limiting SSFR is reached. The up- 
per left panel in Fig. [TT] hence shows the two alternative 
extrapolations and all obtained data are given in Tab. [7j 
The contribution of the integral generally lifts the SFRD 
at a given redshift by a linear factor of ~ 1.4 if we assume 
an SSFR-limit (red filled circles) and ~ 1.7 if a single low- 
mass end slope a is used (red open circles) , suggesting the 
stacking analysis missed ~ 30 % or ~ 40 % respectively 
of the integrated SFRD. The differences between both 
extrapolations are largest at z > 1 below which either 
method yields practically the same results. As pointed 
out in Sec. 15.11 our data cannot clearly rule out any 
of the two alternative low-mass Schechter functions pro- 
posed. Consequently, our extrapolations overlap within 
their individual uncertainty ranges at all redshifts. In 
the following we favor, however, the extrapolation that 



° 7 As Marchesini et al. ( 2009) estimate the mass function based 
on a Kroupa IMF, masses are directly comparable to ours. 
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includes the SSFR-limit as it assures a more conservative 
approach compared to the generally larger values the al- 
ternative extrapolation yields. 

It is evident that the number density corrections at 
our low-mass end discussed in Sec. 15.11 have almost no 
impact on our direct measurements (depicted as black 
open circles in the upper left panel of Fig. ITTj) of the 
SFRDs. Even more, at z < 1.5 practically no correc- 
tions were necessary which highlights again that our in- 
ferences are most robust at these redshifts. It is there- 
fore justified to regard the corrected values (depicted as 
black filled circles in the upper left and lower panel of 
Fig. ITU as a direct and dust unaffected average mea- 
surement of the SFRD for SF galaxies to a lower mass 
limit of M* ~ 3.2 x 10 9 M Q . The evolutionary power- 
law, scaled to our data points and with the indices in 
the two redshift regimes, matches well the observed cos- 
mic star formation history (CSFH) with respect to our 
measured data points presenting lower limits as well as 
to the integrated ones. This may seem surprising as our 
evolutionary model does not take into account differen- 
tial effects with mass as introduced by assuming an SSFR 
limit (case B in Sec. 15. ip . However, since the bulk of the 
mass-integrated SFRD is contained in our direct stack- 
ing based measurements even at high z - which show 
a mass-independent evolution - the model represents a 
good approximation. 

In Fig. [TTJ we also comparer^ our data to the CSFH 
derived from confirmed SF radio sources within the COS- 
MOS field in conjunction with extrapolations based on 
two distinct evolved local radio luminosity functions (see 
ISmolcic et al.ll2009al for details) . This comparison shows 
how good a deep radio survey constrains the CSFH 
and le ads us to slight ly favor the extrapolations based 
on the | Condonl (fl9 89'l radio luminosity function in the 
ISmolcic et al.l (|2009al ) study as already our mass-limited, 
direct, stacking-based measurements of the SFRD on 
their own already reach the values which they inferred 
using the Sa dler et al.l (|2002f) radio luminosity function 
at any z < 1. 

The upper right panel of Fig. [TTJ shows our data alon; 
with other radio-based measurements (Haarsm a et al 
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2Mk [M achals ki fc GodlowsBl27jQ0t iCondon et ; al.ll2002t 
Sadler et al.l l2002t iSerieant et al.l l2002t ISmolcic et al l 
2009al) and the radio-stacking derived CSFH by 



Dunne et al.l {2009). Since the referenced measurements 



b ased on radio d e tection s have been extensively discussed 
in lSmolcic et al.l (l2009af) , we wil l focus our comparison on 
the study of lDunne et al.l (|2009| ) as it is methodologically 
closest to our study. Here the extrapolations towards 
faint sources are based on the evolving K-band luminos- 
ity function with the fixed faint-end slope presented by 
iCirasuolo et al.1 (|2010t ). As we previ o usly p ointed out, 
the evolutionary trends iDunne et al.1 (|2009f ) find are in 
good agreement with our results. Also on absolute scales 
the results of both studies are basically indistinguish- 
able with significant!^! deviations only at the highest red- 

58 All literature data mentioned within the remainder of this 
Section are based on a Salpeter IMF and have been converted to 
the Chabrier scale. 

59 The error bars to the data points shown here — for which 
we assume an upper SSFR limit - are smaller at high redshifts 
compared to those of the corresponding ones if no limit is assumed. 
We might have underestimated the uncertainty introduced by the 



Table 7 

The total SFR density as a function of redshift (cosmic star 
formation history) 

\ SFRD obs (2) [M /yr/Mpc 3 ] SFRD int (^) [M /yr/Mpc 3 ] 
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shifts probed in our study. However, at z > 1.5 the 
trends observed ten d to suggest different conclusions as 
IDunne et al.l ((2009) find a clear peak of the CSFH around 
z = 1.5 — 2 followed by a strong decline of the SFRD 
with redshift. Indeed, at the highest redshifts probed in 
our study the SFRD extrapolated by IDunne et all (|2009h 
does not exceed our direct measurement (without extrap- 
olations, see the top left panel in Fig. [TTJ. Hence, one 
would have to assume that galaxies below ~ 10 10 M Q 
do not contribute at all to the mass-integrated SFRD at 
z > 2 in order to support this peak based on our data. 

Especially when compared to dust extinction cor- 
rected UV-based studies the existence and location of 
the CSFH peak as measured by a dust-unbiased star 
formation tracer is important. Recent UV-based mea- 
surements (|Reddv k. Steidell 120091: IBouwens et al.ll2010l) 
suggest a peak of the CSFH at around z = 2 — 2.5 
and assume that the dust obscuration for the bulk of 
SF galaxies does not dramatically change out to z ~ 4 
(as shown by e.g. IBouwens et al. 2009; Finkclstei n et al.l 
[2001 iMcTure et alj|201tt iWilkins et~al.ll2010D . Our ris- 
ing SFRD at z > 2 suggest that SF galaxies at these 
redshifts have a somewhat higher dust content or obey a 
different reddening law than the corresponding sources at 
lower redshifts. Whether dust-obscured sources at z > 2 
are lost in optical/UV based measurements of the CSFH 
cannot be definitely answered given the mentioned large 
error bars our data show. Future Herschel studies of the 
total IR luminosity evolution at z > 2 should reveal po- 
tentially larger dust reservoirs in these systems. The even 
more rapid decline of the radio-based SFRD as derived 

extrapolations in the former case because of the assumed error to 
the dynamical timescale (370±50 Myr) which is the pu rported limit 
to the inverse SSFR. However, it should be noted that IDunne et aT] 
(2009) do not include any uncertainty caused by their extrapolation 
into their error budget. 
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Figure 11. Upper left: The cosmic star formation history (CSFH) out to z = 3 from the VLA-COSMOS survey. Black circles (raw and 
number density-corrected) represent the sum of the data points in all redshift-bin panels of Fig. [8] and hence a direct - stacking-based - 
measurement of the SFRD down to the limiting mass at each epoch. Evidently, the number density corrections are always small and no 
corrections are necessary at z < 1.5 where our results are most robust. Red filled circles correspond to the 'total' SFRD at each epoch, 
obtained by in tegrating the Sch echter-function fit (Fig. [§) down to M* = 10 5 Mq and assuming an upper limit to the average SSFR (case 
B in Sec . 15.11 see also Sec. I4.1l for details). Red open circles are obtained by the same method assuming no upper SSFR-limit (case A in 
Sec. 15.11 1. The redshift evolution can be described by a broken power-law (dashed lines) that results from the joint (non-)e volution of the 
SF stel lar mass density and the evolution of the (S)SFR-sequence. Down- and upward-facing triangles depict the results by I Smolcic et all 
(2009a) based on VLA-COSMOS radio detections extrapolated by two distinct radio luminosity functions (LF). Upper right: Compilation 
of radio-based literatu re estimates of the integrated CSFH between < z < 4, compared to our results . Th e radio-stacking based results 
bv lDunne et al.1 120091) are depicted in grey and suggest a clear peak of the CSFH at z ~ 1.5 (see Sec. 15.21 for a full list of references and 
discussion). Bottom: Mi d- to far-IR measurements of SFRDs between < z < 2.5 along with our data and the 3<r envelope from the 
Hopkins & Beacom (2006) compilation. The Herschcl/PACS-based results (Gruppioni et al. 2010) are lower limits at z > 1.2 and should 
be compared to our non integrated measurements (filled black circles). Note the remarkable agreement of the IR- and radio-based data at 
all z. 
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by iDunne et all ([20 09') appea rs to support the picture 
drawn by e.g. iBouwens etall ()2010l ) that obscured star 
formation does not significantly contribute to the global 
SFRD at z ^> 4. Such conclusions should, however, be 
treated with caution as it is highly unclear at such early 
cosmic times what extrapolation to the directly measured 
radio derived SFRD s are needed given th e high stellar 
mass limits (see also iGallerani et "aLll2010f ). 

Our integrated CSFH is fur ther supported by recent 
studies car ried out at mid-IR ([Rodighiero et all feoiOb') 
and far-IR ([Gruppioni et al.ll2010l ) wavelengths. The lat- 
ter study is based on ~ 210 - 240 Herschel/PACS de- 
tections at 100 and 160 /xm in 150 arcmin 2 within the 
GOODS-N field and provides us with a deep view of the 
dust-unbiased CSFH. The lower panel in Fig. ITTI shows 
that the agreement of the Her schel-based results pre- 
sented by I Gruppioni et all (|2010l ) with our radio-stacking 
derived mass-integrated CSFH is striking. Out to z ~ 1 
we also find a broad agre ement w ith the measurements o f 
iRodighiero etall (|2010bD . While iGruppioni et all <f2010h 
show only lower limits at z > 1 below this redshift both 
studies measure an evolution of (l+z) n with n = 3.8±0.3 
(0.4) A recent 24 fj,m based study by iRuiopakarn et all 
(2010) confirms this result measuring n = 3.4 ± 0.2. 
All these values are in remarkable agreement with our 
average measured evolution of the SSFR-sequence of 
(nsFc) = 3.5 ± 0.02 (see Sec. Hand Tab. 00 and hence 
a strong support for both our measurement as well as 
our parameterization given in Eq. (|13l) , especially out to 
z = 1. It is, however, worth mentioning that our work, 
compared to the other studies mentioned, draws on a far 
larger sample. 

Finally we want to stress the fact that any shallower 
high redshift trend in the evolution of the SSFR-sequence 
also at the high-mass end would indeed lead to a decline 
in the evolution of the SFRD as Eq. (fj"3|) suggests. This 
scenario cannot be ruled out given our data as the SSFR 
at the low-mass end of our sample tends to flatten and the 
high-mass end might follow at slightly higher redshifts 
based on the dynamical time arguments presented in Sec. 
14.21 that result in a global upper limit to the average 
SSFR. Hence, again, a deeper mid-IR selected sample 
of SF galaxies is needed to accurately probe the regime 
above z = 1.5 where the CSFH is supposed to peak. 

6. SUMMARY AND CONCLUSION 

Based on an unprecedentedly rich sample of galaxies 
selected at 3.6 fxm with panchromatic (FUV to mid-IR) 
ancillary data and mapped in 1.4 GHz radio continuum 
emission in the COSMOS field we have measured stellar 
mass-dependent average (specific) star formation rates 
((S)SFR) in the redshift range 0.2 < z < 3. These were 
obtained using a median image stacking technique that 
is best applied in the radio regime where the angular 
resolution is high and the fraction of direct detections is 
comparatively low such that blending of sources is neg- 
ligible. 

We individually measured integrated radio flux densi- 
ties in each stacked image and showed that a uniform 
(i.e. mass-independent) correction factor is inappropri- 

60 Note that the scatter of the individual measurements at dif- 
ferent redshifts of /?sfg stated in Tab. \E\is actually a more realistic 
uncertainty range than the formal error to their weighted mean. 



ate to convert between peak and total flux density when 
high angular resolution radio continuum data is used. 
Furthermore, we applied various criteria in order to min- 
imize the impact of contaminating radio flux density from 
active galactic nuclei and discussed to which lower mass 
limit at a given redshift our sample remains represen- 
tative with respect to the star formation properties of 
the underlying population. We emphasize that all our 
findings are to be regarded as most robust at z < 1.5 
while our data place valuable constraints on evolution- 
ary trends at the highest masses as far as z — 3. 

Using the template-based, rest-frame (NUV — r + ) tem p 
color from SED-fits in the NUV-mid-IR, we separate SF 
galaxies from quiescent systems in order to study the av- 
erage mass dependence of their SSFR at all epochs con- 
sidered. We also discussed potential effects introduced 
by such a color threshold, such as mimicking a potential 
upturn of the SSFR-M* relation. 

Our findings are summarized as follows: 

1. The massive end of our global sample of mass- 
selected galaxies (including quiescent and SF sys- 
tems) shows a power-law relation between SSFR 

and stellar mass (SSFR oc M£) with an index of 
roughly —0.7 < /3all iS —0.6 and a trend towards 
shallower indices with increasing redshift. Towards 
lower masses the relation appears to flatten, prob- 
ably because quiescent galaxies with low SSFRs 
preferentially occupy the massive end of the nor- 
mal galaxy population. 

2. For a given stellar mass we report a strong increase 
of the SSFR with redshift that is best parametrized 
by a power-law oc (1 + z) 4,3 . 

3. The relation between SSFR and mass for star form- 
ing (SF) systems only (referred to as the SSFR- 
sequence) evolves as (1 + z) 3 - 5 and shows a shal- 
lower power-law index of /3sfg ~ —0.4 because qui- 
escent galaxies do not lower the observed average 
SSFRs at the high-mass end anymore. The param- 
eter /3sfg does not significantly change with cosmic 
time so that the average SSFR is best described by 
a separable function in mass and redshift (Eq. [5] in 
Sec. EM- 

4. Towards lower masses and z > 1.5 also the SSFR- 
scqucncc itself tends to flatten which might be ex- 
plained by an upper limiting threshold where aver- 
age SF systems already reach levels of star forma- 
tion that qualify them to double their mass within 
a dynamical time. It is plausible that the SSFR 
at a given time does not continue to increase till 
the regime of dwarf galaxies at the rate predicted 
by our power-law index. We, however, cannot rule 
out that low-mass systems with high star formation 
activity but also very high dust content are missed 
given the limiting magnitude in our selection band. 

We firmly conclude that, out to z ~ 1.5, our results 
hence neither support the so-called 'SSFR-downsizing' 
nor '-upsizing' scenarios proposed by some earlier work 
while they do confirm the downsizing scenario in the fol- 
lowing take: 
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The SFR declines strongly but in a mass-independent 
fashion while the most massive galaxies always show the 
least star formation activity and are hence the first to fall 
below their past-average star formation activity. 

By taking advantage of the simple functional form of 
both the (S)SFR-sequence and the mass function of SF 
galaxies in the redshift range we study we have shown 
that 

• the mass distribution function of the comoving SFR 
density (SFRD) at any redshift below z = 1 is well 
parameterized by a single Schechter function with 
a possible low-mass modification at higher z. 

• the typical mass of a SF galaxy contributing most 
to the total (stellar mass integrated) SFRD is 
-^q10.6±o.4 with no evidence for evolution out 
to z — 3. 

Out to z Ri 1 the evolution of the integrated SFRD, 
in turn, is entirely controlled by the mass-uniform evolu- 
tion of the SSFR-sequence as the number of SF galaxies 
in a given comoving volume does not change anymore. A 
strong and global decline in the mass density of molecu- 
lar gas, i.e. the reservoir out of which stars are formed, 
appears therefore to be the only driver of the observed 
decrease of the integrated SFRD with cosmic time. The 
rate at which the SFRD declines is in excellent agree- 
ment with the most recent other studies that use mid- to 
far-IR emission as an alternative dust-unbiased tracer for 
star formation. Towards earlier epochs this steep trend 
becomes shallower as the comoving stellar mass density 
of SF systems decreases. In other words, there are simply 
less (SF) galaxies at z > 1 while their individual SFRs 
further increase with redshift. This statement is certainly 
valid for galaxies more massive than 10 10 Mq which dom- 
inate the CSFH at all epochs out to z = 3. Our results do 
not suggest any change of this trend towards the highest 
rcdshifts probed but it should be emphasized again that 
our data cannot constrain the situation as strongly as at 
z < 1.5. Hence, we do not rule out that the CSFH peaks 
in this redshift range. Indeed, the constancy of the SSFR 
at z ^> 2 suggested by other studies and motivated by 
the dynamical time threshold we discuss would give rise 
to a decline of the global SFRD at such high redshifts. 
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APPENDIX 

A STATISTICAL ESTIMATOR FOR THE STELLAR MASS REPRESENTATIVENESS OF A FLUX DENSITY-LIMITED 

SAMPLE 

In principle one could roughly estimate stellar mass completeness limits by visual inspection of Fig. [TJ Given a flux 
density limit (i.e. F3.6 ~ 5 /LtJy for all and -F3.6 ^ m ~ 1 A*Jy f° r SF galaxies; see Sec. I2.6[) below which no objects 
of a given spectral type (Sec. 12. 4|) should be considered one would select by eye a stellar mass limit upward from 
where there are no objects below the flux density threshold. As pointed out in Sec. 12.61 it is, however, necessary to 
analytically derive these stellar mass limits in order to ensure that - within a narrow mass-range just at any limiting 
mass - we are dealing with a distribution of flux densities that can be considered representative for the one of the 
underlying population. A statistical estimator is needed for obtaining the actual level of representativeness we achieve 
at a given stellar mass. 

Our aim is to compare the properties of the exponential decline of the observed distribution of 3.6 /im flux densities 
- i.e. the distribution of low values of flux density towards the flux density limit in our selection band - within a 
narrow bin in logarithmic stellar mass to the analogously exponentially declining Gaussian distribution. As explained 
in the following it is sufficient for this comparison to derive the relative distance between (a) the 0.95 percentile of 
the observed distribution of flux densities to the flux density limit and (a) the 0.9 to the 0.95 percentile of the same 
observed distribution. The choice of the two percentiles mentioned is hereby entirely arbitrary. 

First we have to derive the corresponding ratio of distances for an arbitrary normal distribution that is cut at a given 
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Figure 12. Analytic evaluation of the statistical 95 % statistical completeness in different redshift bins, based on a flux density threshold 
of F3.6 |im = 5 fijy (?tiab(3.6 jim) = 22.15) corresponding to the 90 % level of intrinsic catalog completeness for all galaxies (black circles). 
For star forming galaxies (blue stars) the evaluation is based on a flux density threshold of F3.6 Mm = 1 /ijy (t?iab(3.6 ^m) = 23.9, i.e. the 
magnitude limit of the catalog). For a detailed discussion of the meaning of statistical co mple teness, the choice of flux density thresholds 
and the implications on sample representativeness with respect to star formation see Sec. 12.61 The photometric redshift slices depicted in 
the individual panels are the same as in Fig. [l] The quantity {x/y) h s measures how well the the distribution of flux densities at the faint 
end within a given sample in a narrow mass bin (Alog(Af*) = 0.2) follows the decreasing wing of a Gaussian distribution. Where it crosses 
the dashed horizontal line the Gaussian is cut at the 0.95 percentile and the anticipated statistical completeness limit (vertical lines) is 
reached. The method is described in detail in Appendix [A] Lines connecting the data points are meant to guide the eye. 



percentile. This percentile sets the representativeness we want to achieve, i.e. 0.95 in our case. Since the width of 
and hence the lengthscale defined by a Gaussian is determined by a single parameter a any ratio of distances between 
given percentiles is independent of the actual value of a or the normalization. Given for instance a quantity x defined 
as the distance between the 0.9025 (— 0.95 x 0.95) percentile of a given normal distribution and its 0.95 percentile 
as well as a quantity y defined as the distance from the 0.855 (= 0.95 x 0.9) to the 0.9025 percentile of the same 
distribution, their ratio (x/y)Ga, uss yields a value of 1.467 that is universal, i.e. independent of the actually chosen 
normal distribution. It was obtained by taking advantage of the cumulative distribution function that connects a 
percentile to the corresponding actual value x a defined by the specific Gaussian of width a centered at n via the error 
function (erf): 



1 + erf 



aV2 



(Al) 



Differences in percentiles A$ Mj(T = ^ ^^{x a j) — $ ^^{x a j) thus translate into physical distances Acc CT = x a j — x a i 
solely defined by the scale a. 

Assuming that our data in narrow bins of stellar mass and redshift follows a normal distribution the distance from 
the 0.95 percentile of the observed distribution of 3.6 /im flux densities to the flux limit of the sample yields a value x bs 
in units of flux density. Accordingly the distance between the 0.9 and the 0.95 percentiles of the observed distribution 
defines a value y b s and the dimensionless ratio (x/y) h s = ^obs/2/obs can be compared to the aforementioned value of 
(a;/y)Gauss- Given the case that the flux density limit is located far in the tail of the observed distribution, (x/y) h s 
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will exceed (x/y)G auss and the observed distribution is statistically representative of the underlying population of 
objects. As the flux density limit approaches the peak of the observed distribution the observed ratio becomes lower. 
As soon as it overlaps with the 0.95 percentile of the (unknown) distribution of the underlying population the limiting 
case of 95 % statistical completeness - and hence the desired lower level of representativeness - is reached so that 
{x/y)ohs = (x/y)G a uss- 

For our sample this effect is shown in Fig. 1 121 where (x/y) Gauss is indicated as a dashed horizontal line for individual 
ranges in photometric redshift. The finally chosen stellar mass representativeness limits are denoted by vertical lines. 
The data points result from an implementation of the method described in this section that additionally takes into 
account the detection completeness levels of the catalog as a function of 3.6 /im flux density. Here we therefore obtain 
the mentioned percentiles using flux densities weighted by the corresponding inverse catalog detection completeness. 

It is worth noting that the Gaussian distribution is just one possible parameterization and not a necessary requirement 
for the method described here. Indeed, the underlying distribution of flux densities is not even required to be symmetric. 
Our method simply ensures that the observed distribution is smoothly, approximately exponentially declining to low 
levels of flux density, as one may realistically expect from random processes such as photon noise and confusion. 
It is simply a practical, quantitative improvement over the alternative method of visual inspection as the latter is, 
essentially, assuming an unphysical step-function rather than a continuos distribution function. 

STATISTICS 

In the following a set of N pixels will always be written as Xn, regardless if the constituents Xi (i = 1, . . . , N) are 
noise pixels or a sample of peak flux densities. We will specify at any stage, if we are referring to background noise, 
in which case we will use the upper case indication 'bg'. 

Noise weighted estimators 

Due to the non-uniform noise distribution in the VLA-COSMOS map, the input samples used for stacking are ill- 
defined to some extent. Solely discarding the high-noise edge regions does not remedy the fact, that there is significant 
variation of the rms background noise in the cutout postage stamps originating from a broad spatial distribution 
across the field. Our aim is to find the best estimator for the representative value of the underlying population. 
Therefore the sample should consist of a random and independent set of sources drawn from this population under 
equal circumstances. To approach the last condition, that is not achievable in observational reality, we have to compare 
the outcome of the stacked sample to that of a weighted sample, in which those stamps gain more influence, that lie 
in low noise regions. 

Regarding the mean-stacking technique it is statistically well known, that appropriate weights are found in the 
reciprocal variance of each particular stamp's noise pixel sample where the variance of the ith stamp is defined as 

Vari = Var (x^tg) = o\ % (x^) . As explained above X°^\ = ja^ 8 , . . . , x^ E j. 

The noise- weighted mean of the sample Xn of peak fluxes can thus be considered the mean of the weighted sample 
Xn, where the constituents Xi of Xn are defined as 

Xi — WiXi = N 1 _, Xi where Xi € Xn. (Bl) 

E* Var ; 

With the definitions Wi = Var;" 1 and Wi = Wi/ J^i^i it is easily shown, that the mean of the Xi defined in (|Blj) 
indeed equals the noise- weighted mean of the x^. 

(X) = 1 £ s, = 1 £ ffi,*, = £ WiXi = (B2) 

The above discussion leads to the suggestion, that in the presence of varying rms-noise in a given sample, the sample 
Xn is the appropriate one to consider not only with respect to the mean value of the sample. It seems reasonable, that 
also its median is the best estimator for the median of the underlying population, because both computed quantities, 
the median and the mean, are then referring to the same sample. We will refer to this choice of an estimator in the 
following as a noise- weighted median. 

Bootstrapping 

In the above discussion we justified that neither the observed nor the intrinsic distribution of peak fluxes are expected 
to be gaussian. This needs to be taken into account no matter if we are looking for an appropriate uncertainty range 
to the median or mean estimator for a given sample. In order to obtain a 68 % confidence interval not relying 
on normality of the underlying parent distribution we therefore chose a bootstrapping technique for the statistical 
parameter of choice. 

In each case we obtain the limits of the confidence interval by a bootstrapped Student's t-distribution. This technique 
is called studentizing. A (1 — a) confidence interval for a parameter X in traditional statistics is given by 

CI a /2 — X ± t a / 2 —¥= , '■ standard deviation of A". (B3) 

V N 
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Here t a /2 denotes the a/2 percentile of the classical ^-distribution which is equal to the (1 — a/2) percentile due to the 
symmetry of Student's distribution. Bootstrapping a t-distribution means to circumvent the assumption of a normally 
distributed population by deriving the quantity 

for i = 1, . . . , iVbootstrap samples drawn from the original sample of peak fluxes with replacement. In case of X = (X) 
being the sample mean one thus has to compute the sample mean (X) as well as all means of the bootstrapped samples 
(X*)i including standard deviations s* x ,y. The resulting distribution of iV bootstrap i*-values is then used to compute 

the upper and lower confidence limits by taking its a/2 and (1 — a/2) percentiles: 



Cli; a = X + t* a/2 -Z= (B5) 



s 2 

ClL a ^-tl_ a/2 ^, (B6) 

where in general we chose 1 — a = 0.68 obtaining thus a 68 % confidence interval. Here = s/x) still is just the 
standard deviation of the original sample. In case of X = Med (X) denoting the sample median as the parameter of 
choice we have to face the problem that the denominator of (|B4[) does not provide us with an estimator of the standard 
error of the median. In order to estimate this latter quantity we need to access the empirical standard deviation of a 
sample of medians being representative for the median of the current bootstrapped sample. Starting from this sample 
we thus generate a number of new bootstrapped samples hence performing a bootstrapping within the bootstrapping 
procedure^. The standard deviation s* Med ^ x ,,^ of these subsamples' medians is then used as an estimator for the 



standard error of the single outer bootstrapped median as given by the denominator of Eq. (|B4|) . In order to use Eq. 
(|B5[ IB6|) we furthermore need the standard error of the original sample's median. This is estimated by computing the 
median of each outer bootstrapped sample and taking the standard deviation s Med ( X *) of this sample of medians as 
the standard error. 

SELECTION OF STAR FORMING GALAXIES 

Because we cannot measure radio-based SSFRs for individual galaxies, selecting the SF population directly in the 
SSFR-M* plane is impossible. In Sec. 12.41 we argue that the intrinsic (dust-extinction corrected) rest-frame NUV-r 
color is a reliable way to select SF galaxies (see also 110). In this section, we test our color selection in two ways 
in order to demonstrate its fidelity and to assess how our findings relate to previous measurements in the literature: 
we (1) choose a bluer color-cut to study the ensuing changes in the e volution of the SS FR-M* relation, and we (2) 
compare both color cuts to the BzK-selection of SF galaxies at high z (iDaddi et al. 2004). 

Highly active star forming galaxies 

110 have shown that the color selection criterion (NUV — r + )t cm p < 1-2 leads to a morphologically clean sample of 
late-type spiral and irregular galaxies with template SED-based SSFRs that are clearly separated from the passive 
population (see Sec. I2.4j) . This color threshold is somewhat arbitrary (as it is less well motivated than the cut 
we applied to select SF systems) but by virtue of being substantially bluer than our original choice it minimizes 
contamination by passive galaxies. 

We derived SSFRs as a function of redshift and mass in the same way as before for galaxies with (NUV-r + ) tcmp < 1.2. 
Although the exclusion of systems with intermediate star forming activity has reduced the sample size considerably, 
it was still possible to cover the same dynamic ranges. Only the binning scheme has been slightly modified for this 
strongly star formation population (see Fig. f]~3)) . Its SSFRs usually are significantly higher compared to our original 
choice of SF galaxies, the slope f3 of the SSFR-M* relation is shallower at low to intermediate redshifts and thus 
in excellent agreement with those literature results for SF galaxies discussed in Sec. 14.41 that report an almost flat 
SSFR-sequence. At the high- z end we see a steepening of the slope j3 (i.e. an 'upsizing' tre nd), similar o r even more 
evident to what was found by rRodighie ro et al.1 (|2010al ) and, at lower significance , also bvlOliver "eTaLl (pOlH . The 
evolutionary exponent n is consistent with previous measurements as well (see e.g. iPannella et al.ll2009l ). 

The bluer color threshold hence is able to reproduce most literature findings albeit with SSFRs that tend to be 
comparativey high, especially at low redshift. However, it has yet to be confirmed that the galaxy population selected 
in this way is representative of the entire SF population. 

( s )BzK galaxies at z ~ 2 

We cross-matched o ur 3.6 ^m selected catalog with the K band selected catalog for the COSMOS field 
(jMcCracken et all 120101 ) and thu s obtain a magnitu de calibration in the crucial wavebands that allows us to ap- 
ply the BzK selection criterion of IDaddi et al.l ()2004l ). Fig. [T4l shows the BzK diagram for our sample, with galaxies 

61 The number of outer bootstrapped samples is typically chosen , . . 

to be an order of magnitude larger compared to the one of the inner °° S ra PP ln S- 
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Figure 13. The SSFR-M* relation (upper left) and time evolution of SSFRs in various mass-bins (upper right) for galaxies with high star 

formation activity ((NUV-r+)tcmp < 1.2). Dashed lines (color coded by redshift) denote two-parameter fits of the form c X to the mass 
complete data (filled circles). The dynamical ranges are the same as in Figure \E\ and [6] where the other quantities shown are explained. 
The color threshold is substantially bluer and rather arbitrary compared to the one used for the selection of SF systems. Compared to 
the results of the entire SF sample the radio stacks yield a flatter SSFR-sequence out to 2 ~ 1.5 (/3 ~ —0.08 ± 0.05) and a mild 'upsizing' 
trend (lower panel) while overall a shallower evolution of the SSFR <x (1 + 2 ) 2 -3±0-3 j s found for this sample of most vigorously SF galaxies 
(upper right, where fits to the mass-complete data in the different mass bins are depicted as dashed black lines). All these trends seen for 
highly active SF galaxi es ar e hence a result of a simple selection effect. The inverse horizontal red band sketches the inverse dynamical 
time as detailed in Sec. 14.11 

in six redshift slices color coded according to their (NUV — r + ) te mp color as described in Sec. 12.41 Our 'star forming' 
sample is the union of all galaxies plotted in blue and green. 

At z > 1.5, the sBzK criterion (all galaxies to the left of the diagonal line in each panel) is established to efficiently 
select normal SF systems. Fig. [14] illustrates that the selection window for sBzK galaxies is populated by both the 
most actively star formation sources (blue dots) and the majority of the sources with intermediate star forming rates 
(plotted in green; i.e. the rest of the SF sample used throughout this article). Only a small number of objects with 
moderate SF activity fall into the passive BzK region i n the upper right of each panel. This is the reason for the 
aforementioned excellent agreement of our results with iPannella et alj (|2009fl at z w 2.1; their and our sample are 
virtually indistinguishable and both studies rely on the same radio data. More importantly, however, the BzK diagram 
strongly supports our original selection of SF objects in the crucial redshift regime z > 1.5, where we have just shown 
that previously reported changes in the slope (3 can be mimicked by simply selecting only very blue objects, hence, 
most actively forming systems. 




Figure 14. BzK diagram of our sample in various redshift bins. The color coding refers to our choice of the (NUV — r+)temp color 
threshold in order to prede fine systems with h igh (blue), intermediate (green) and negligible (red) star formation activity. In the lowest 
redshift panel the original (Daddi ct al. 2004) sBzK criterion (diagonal line) does not appear to be efficient enough in selecting all SF 
galaxies and is particularly missing the systems with intermediate levels of star formation. At higher redshifts our SF sample (all green 
and blue sources) overlaps very well with the sBzK population so that our color selection for the purpose of radio stacking is appropriate 
to select normal SF systems at z > 1.5. 
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